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DESCRIPTION 

PLANT ARTIFICIAL CHROMOSOME (PLAC) COMPOSITIONS AND METHODS 

The government owns rights in the present invention pursuant to U.S. Department of 
Agriculture Grant No. 96-35304-3491 and Grant No. DE-FC05-920R22072 from the 
Consortium for Plant Biotechnology. This application claims the priority of U.S. Provisional 
Patent Application Serial No. 60/048,451, filed June 3, 1997; and U.S. Provisional Patent 
5 Application Serial No. 60/073,741, filed February 5, 1998, both of the disclosures of which are 
specifically incorporated herein by reference in their entirety. 

I. Field of the Invention 

The present invention relates generally to the field of molecular biology. More 
particularly, it concerns the construction and use of plant artificial chromosomes. 

10 II. Description of Related Art 

Two general approaches are used for introduction of new genetic information 
("transformation") into cells. One approach is to introduce the new genetic information as part 
of another DNA molecule, referred to as a "vector," which can be maintained as an independent 
unit (an episome) apart from the chromosomal DNA molecule(s). Episomal vectors contain all 

1 5 the necessary DNA sequence elements required for DNA replication and maintenance of the 
vector within the cell. Many episomal vectors are available for use in bacterial cells (for 
example, see Maniatis et ai 9 1982). However, only a few episomal vectors that function in 
higher eukaryotic cells have been developed. The available higher eukaryotic episomal vectors 
are based on naturally occurring viruses and most function only in mammalian cells (Willard, 

20 1997). In higher plant systems the only known double-stranded DNA viruses that replicate 
through a double-stranded intermediate upon which an episomal vector could be based is the 
gemini virus, although the gemini virus is limited to an approximately 800 bp insert. Although 
an episomal plant vector based on the Cauliflower Mosaic Virus has been developed, its 
capacity to carry new genetic information also is limited (Brisson et al. y 1984). 

25 The other general method of genetic transformation involves integration of introduced 

DNA sequences into the recipient cell's chromosomes, permitting the new information to be 
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The essential elements are: (1) Autonomous Replication Sequences (ARS) (these have 
properties of replication origins, which are the sites for initiation of DNA replication), (2) 
Centromeres (site of kinetochore assembly and responsible for proper distribution of replicated 
chromosomes at mitosis or meiosis), and (3) Telomeres (specialized structures at the ends of 
5 linear chromosomes that function to stabilize the ends and facilitate the complete replication of 
the extreme termini of the DNA molecule). 

At present, the essential chromosomal elements for construction of artificial 
chromosomes have been precisely characterized only from lower eukaryotic species. ARSs 
have been isolated from unicellular fungi, including Saccharomyces cerevisiae (brewer's yeast) 
10 and Schizosaccharomyces pombe (see Stinchcomb et aL, 1979 and Hsiao et aL y 1979). ARSs 
behave like replication origins allowing DNA molecules that contain the ARS to be replicated 
as an episome after introduction into the cell nuclei of these fungi. Plasmids containing these 
sequences replicate, but in the absence of a centromere they are partitioned randomly into 
daughter cells. 

1 5 Artificial chromosomes have been constructed in yeast using the three cloned essential 

chromosomal elements. Murray et al. 9 1983, disclose a cloning system based on the in vitro 
construction of linear DNA molecules that can be transformed into yeast, where they are 
maintained as artificial chromosomes. These yeast artificial chromosomes (YACs) contain 
cloned genes, origins of replication, centromeres and telomeres and are segregated in daughter 

20 cells with high affinity when the YAC is at least 100 kB in length. Smaller CEN containing 
vectors may be stably segregated, however, when in circular form. 

None of the essential components identified in unicellular organisms, however, function 
in higher eukaryotic systems. For example, a yeast CEN sequence will not confer stable 
inheritance upon vectors transformed into higher eukaryotes. While such DNA fragments can 
25 be readily be introduced, they do not stably exist as episomes in the host cell. This has 
seriously hampered efforts to produce artificial chromosomes in higher organisms. 

In one case, a plant artificial chromosome was discussed (Richards et aL, U.S. Patent 
No. 5,270,201). However, this vector was based on plant telomeres, as a functional plant 
centromere was not disclosed. While telomeres are important in maintaining the stability of 
30 chromosomal termini, they do not encode the information needed to ensure stable inheritance of 
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advantage arises from the quartet (qrt 1) mutation in Arabidopsis, which causes the four 
products of pollen mother cell meiosis in Arabidopsis to remain attached. When used to 
pollinate a flower, one tetrad can result in the formation of four seeds, and the plants from these 
seeds can be analyzed genetically. With unordered tetrads, however, such as those produced by 
5 Arabidopsis, genetic mapping using tetrad analysis requires that two markers be scored 
simultaneously. 

In another embodiment, the present invention provides new plant artificial 
chromosomes (PLACs) and DNA fragments for the creation thereof. In a preferred 
embodiment the PLAC will have functional sequences which include telomeres, a plant and / or 

10 other autonomous replicating sequence, a centromere, and selectable markers which confer a 
growth advantage under particular conditions to plant cells carrying the marker, thereby 
allowing identification of plants, plants cells or cells from any other organism of interest 
containing the PLAC. A selectable marker may in particular embodiments of the invention 
function in bacterial cells. The PLAC also may contain "negative" selectable markers which 

15 confer susceptibility to an antibiotic, herbicide or other agent, thereby allowing for selection 
against plants, plant cells or cells of any other organism of interest containing a PLAC. The 
PLAC also may include genes which control the copy number of the PLAC within a cell. One 
or more structural genes also may be included in the PLAC. Specifically contemplated as being 
useful will be as many structural genes as may be inserted into the PLAC while still 

20 maintaining a functional vector. This may include one, two, three, four, five, six, seven, eight, 
nine or more structural genes. 

In another embodiment, the invention provides methods for expressing foreign genes in 
plants, plant cells or cells of any other organism of interest. The foreign genes may be from 
any organism, including plants, animals and bacteria. It is further contemplated that PLACs 

25 could be used to simultaneously transfer multiple foreign genes to a plant comprising entire 
biochemical or regulatory pathways. In yet another embodiment of the invention, it is 
contemplated that the PLACs can be used as DNA cloning vectors. Such a vector could be 
used in plant and animal sequencing projects. The current invention may be of particular use in 
the cloning of sequences which are "unclonable" in yeast and bacteria, but which may be easier 

30 to clone in a plant based system. 
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about several base pairs to one hundred megabase pairs, including about 1 kb, 25 kB, 50 kB, 
100 kB, 125 kB, 150 kB, 200 kB, 300 kB, 400 kB, 500 kB, 600 kB, 700 kB, 800 kB, 900 kB, 1 
MB, 1.25 Mb, 1.5 Mb, 2 Mb, 3 Mb, 5 Mb, 10 Mb, 25 Mb, 50 Mb and 100 Mb. 

In still yet another embodiment, the present invention relates to the construction of 
5 artificial chromosome vectors for the genetic transformation of plant cells, processes for their 
preparation, uses of the vectors, and organisms transformed by them. Standard reference works 
setting forth the general principles of recombinant DNA technology include Lewin, 1985. 
Other works describe methods and products of genetic engineering; see, e.g., Maniatis et al, 
1982; Watson et al, 1983; Setlow et al, 1979; and Dillon et al, 1985. 

1 0 BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to 
further demonstrate certain aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in combination with the detailed ... 
description of specific embodiments presented herein. 

15 FIG- 1 • Centromere mapping with unordered tetrads : A cross of two parents (AABB x 

aabb), in which "A" is on the centromere of one chromosome, and "B" is linked to the 
centromere of a second chromosome. At meiosis, the A and B chromosomes assort 
independently, resulting in equivalent numbers of parental ditype (PD) and nonparental ditype 
(NPD) tetrads (recombinant progeny are shown in gray). Tetratype tetrads (TT) result only 

20 from a crossover between "B" and the centromere. 

FIG - 2: Low resolution man location of A rabidopsis centromeres. Trisomic mapping 
was used to determine the map position of centromeres on four of the five Arabidopsis 
chromosomes (Koornneef 1983; Sears et al, 1970). For chromosome 4, useful trisomic strains 
were not obtained. With the methods of Koornneef and Sears et al, 1983. (which rely on low- 
25 resolution deletion mapping) the centromere on chromosome 1 was found to lie between the 
two visible markers, /// and chl, that are separated by 5 cM. Centromere positions on the other 
chromosomes are mapped to a lower resolution. 

FIG. 3A-E: Map location of Arabi dopsis thaliana centromeres on chromosomes 1-5 . 
Centromeres were mapped by tetrad analysis as described below. Genetic markers used to map 
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I-Ppo I, I-Ceu I, PI-Tli I, PI-Psp I, Not I, and PI See I. In FIG. 7A, FIG. 7B, FIG. 7C, FIG. 7D, 
FIG. 7E, FIG. 7G, and FIG. 7H the centromere is flanked by Lox sites which can act as targets 
for the site specific recombinase Cre. FIG. 7A: An E. Coli plant circular shuttle vector with a 
plant ARS. FIG. 7B: A plant circular shuttle vector without a plant ARS. The vector relies on 
5 a plant origin of replication function found in other plant DNA sequences such as selectable 
markers. FIG. 7C: A yeast-plant circular shuttle vector with a plant ARS. The yeast ARS is 
included twice, once on either side of the multiple cloning site to ensure that large inserts are 
stable. FIG. 7D: A yeast-plant circular shuttle vector without a plant ARS. The vector relies on 
a plant origin of replication function found in other plant DNA sequences such as selectable 

10 markers. The yeast ARS is included twice, once on either side of the multiple cloning site to 
ensure that large inserts are stable. FIG. 7E: An E. Coli -Agrobacterium - Plant circular shuttle 
vector with a plant ARS. Vir functions for T-DNA transfer would be provided in trans by using 
the appropriate Agrobactehum strain. FIG. 7F: An E, Coli - Agrobacterium - plant circular 
shuttle vector without a plant ARS. The vector relies on the plant origin of replication function 

15 found in other plant DNA sequences such as selectable markers. Vir functions for T-DNA 
transfer would be provided in trans by using the appropriate Agrobacterium strain. FIG. 7G: A 
linear plant vector with a plant ARS. The linear vector could be assembled in vitro and then 
transferred into the plant by, for example, mechanical means such as microprojectile 
bombardment electroporation, or PEG-mediated transformation. FIG. 7H: A linear plant 

20 vector without a plant ARS. The linear vector could be assembled in vitro and then transferred 
into the plant by, for example, mechanical means such microprojectile bombardment, 
electroporation, and PEG-mediated transformation. 

DETAILED DESCRIPTION OF THE INVENTION 

The prior art has failed to provide a centromere which is functional in plants. This 
25 failure is exemplified by the general lack of detailed information in the art regarding the 
centromeres of multicellular organisms in general. To date, the most extensive and reliable 
characterization of centromere sequences has come from studies of lower eukaryotes such as S. 
cerevisiae and S, pombe, where the ability to analyze centromere functions has provided a clear 
picture of the essential DNA sequences. The S. cerevisiae centromere consists of three 
30 essential regions, CDEI, CDEI1, and CDEIII, totaling only 125 bp, or approximately 0.006 to 
0.06% of each yeast chromosome (Carbon et aL, 1990; Bloom 1993). S. pombe centromeres 
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hybridization studies, however, have shown that this fragment is absent from naturally 
occurring centromeres, calling into question the reliability of this approach for testing 
centromere function (Tyler-Smith et al, 1993). Similarly, transfection of alphoid satellites into 
cell lines results in the formation of new chromosomes, yet these chromosomes also contain 
5 host sequences that could contribute centromere activity (Haaf et al, 1992; Willard, 1997). 
Further, the novel chromosomes can have alphoid DNA spread throughout their length yet have 
only a single centromeric constriction, indicating that a block of alphoid DNA alone may be 
insufficient for centromere function (Tyler-Smith et al , 1 993). 

Although plant centromeres can be visualized easily in condensed chromosomes, they 
10 have not been characterized as extensively as centromeres from yeast or mammals. Genetic 
characterization has relied on segregation analysis of chromosome fragments, and in particular 
on analysis of trisomic strains that carry a genetically marked, telocentric fragment (for 
example, see Koornneef 1983; FIG. 2). In addition, repetitive elements have been identified 
that are either genetically (Richards et al, 1991) or physically (Alfenito et al, 1993; 
15 Maluszynska et al, 1991) linked to a centromere. In no case, however, has the functional 
significance of these sequences been tested. 

Cytology in Arabidopsis thaliana has served to correlate centromere structure with 
repeat sequences. A fluorescent dye, DAPI, allows visualization of centromeric chromatin 
domains in metaphase chromosomes. A fluorescence in situ hybridization (FISH) probe based 

20 on 1 80 bp pALl repeat sequences colocalized with the DAPI signature near the centromeres of 
all five Arabidopsis chromosomes (Maluszynska et al, 1991; Martinez-Zapater et al, 1986), 
however this repeat probe also hybridizes within noncentromeric regions of the chromosomes. 
Although a functional role for pALl has been proposed, more recent studies have failed to 
detect this sequence near the centromeres in species closely related to Arabidopsis thaliana 

25 (Maluszynska et al, 1993). These results are particularly troubling because one of the species 
tested, A. pumila, is thought be an amphidiploid, derived from a cross between A. thaliana and 
another close relative (Maluszynska et al, 1991; Price et al, 1995). Another repetitive 
sequence, pAtT12, has been genetically mapped to within 5 cM of the centromere on 
chromosome 1 and to the central region of chromosome 5 (Richards et al, 1991), although its 

30 presence on other chromosomes has not been established. Like pALl, a role for pAtT12 in 
centromere function remains to be demonstrated. 
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Due to the fact that kinetochores constitute a necessary link between centromeric DNA 
and the spindle apparatus, the proteins that are associated with these structures recently have 
been the focus of intense investigation (B.ootn 1993; Eamshaw ,99!). Human autoantibodies 
that btnd specifically in the vicinity of the centromere have facilitated the cloning of 
> centmmere-associated proteins (CENPs, Rattner 1991), and a, leas, one of these proteins 
belongs to the kinesin superfamily of microtubule-based motors (Yen 1991) Yeas, 
centromere-binding proteins also have been identified, both through genetic and biochemical 
studies (Bloom 1993; Lechner « ,991). A , though me basic fcatures „ f ^ ^ ^ 
(mcloding spindle attechment, chromosome pairing, anj chromosome separation) are conserved 
among all eukaryotes, a comparison of the known centromere-binding proteins from yeas, and 
humans reveals little similarity (Bloom, 1993; Eamshaw and Cooke, 1989). 

The centromeres of ****** lhalUma have ^ ^ ^ ^ ^ 
where me segregation of chromosome fragnrents (Koomneef 1983) or whole chromosomes' 
(Sears « „t, ,970, was used to localize four of centmmeres to within 5, 12, 17 and 38 cM 
respectively (FIG. 2). These potions have not been refined by more recent studies because the 
method is limited, no, only by the difficulty in obtaining viable trisomic strains, but also by the 
maccuracy of comparing recombination frequencies in chromosome fragments ,o those in intac, 
chromosomes (Koomneef 1983). These factors introduce significant error into the calculated 
posttion of the centromere, and in ArabUhpsis, where I cM corresponds roughly to 200 kB 
(Koomneef ,987; Hwang e, al, ,991), this method did no, map any of the centromeres with 
sufficent precision to make chromosome walking strategies practical. 

Tetrad Analy sis 

With tetrad analysis, the recombination frequency between genetic markers and a 
centromere can be measured directly (F.G. 1). This method requires analysis of all four 
products of individual meiosis, and i, has no, been applied previously to multicellular 
eukaryotes because their meiotic products typically are dissociated. Identification of tire 
,uar,e< nutation makes tetrad analysis possible for the firs, time in a multicellular genetic 
model system (Preuss e, a,., ,994). The „uar,e, (qrt ,) mutation causes the four products of 
pollen mother ce,l meiosis in AraUJops* to remain atteched. When used ,„ poUinate a flower 
one tetiad can result in me formation „f four seeds, and the plants from these seeds can be' 
analyzed genetically. 
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With unordered tetrads, such as those produced by S. cerevisiae or Arabidopsis, genetic 
mapping using tetrad analysis requires that two markers be scored simultaneously (Whitehouse 
1950). Tetrads fall into different classes depending on whether the markers are in a parental 
(nonrecombinant) or nonparental (recombinant) configuration (FIG. 1). A tetrad with only 
5 nonrecombinant members is referred to as a parental ditype (PD); one with only recombinant 
members as a nonparental ditype (NPD); and a tetrad with two recombinant and two 
nonrecombinant members as a tetratype (TT) (Perkins 1953). If two genetic loci are on 
different chromosomes, and thus assort independently, the frequency of tetratype (crossover 
products) versus parental or nonparental assortment ditype (noncrossover products) depends on 
10 the frequency of crossover between each of the two loci and their respective centromeres. 

Tetratype tetrads arise only when a crossover has occurred between a marker in question 
and its centromere. Thus, to identify genes that are closely linked to the centromere, markers 
are examined in a pair-wise fashion until the TT frequency approaches zero. The genetic 
distance (in centimorgans, cM) between the markers and their respective centromeres is defined 
15 by the function [(1/2)TT]/100 (Mortimer et aL 9 1981). Because positional information obtained 
by tetrad analysis is a representation of physical distance between two points, as one 
approaches the centromere the chance of a recombination event declines. 

Tetrad analysis has been used to genetically track centromeres in yeasts and other fungi 
in which products of a single meioses can be collected. The budding yeast Saccharomyces 
20 cerevisiae lacks mitotic condensation and thus cytogenetics (Hegemann et aL, 1993), yet due to 
tetrad analysis, has served as the vehicle of discovery for centromere function. Meiosis is 
followed by the generation of four spores held within an ascus and these can be directly assayed 
for gene segregation. 

The recessive qrtl mutation makes it possible to perform tetrad analysis in Arabidopsis 
25 by causing the four products of meiosis to remain attached (Preuss et al, 1994; and Smythe 
1994; both incorporated herein by reference). As previously shown, within each tetrad, genetic 
loci segregate in a 2:2 ratio (FIG. 6). Individual tetrads can be manipulated onto flowers with a 
fine brush (at a rate of 20 tetrads per hour), and in 30% of such crosses, four viable seeds can be 
obtained (Preuss et ai y 1994). 
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Mapping centromeres with high precision requires a dense genetic map, and although 
the current Arabidopsis map contains many visible markers, it would be laborious to cross each 
into the art J background. Alternatively, hundreds of DNA polymorphisms can be introduced 
sxmultaneously by crossing two different strains, both containing the artl mutation. A dense 
5 RFLP map (Chang et al, 1988) and PCR-based maps (Konieczny et al, 1993; Bell et al 1994) 
have been generated in Arabidopsis from crosses of the Landsberg and Columbia strains 
(Arabidopsis map and genetic marker data is available from the internet at http://genome- 
www.stanford.edu/Arabidopsis and http://cbil.humgen.upenn.edu/atgc/sslp info/sslp.html) 
These strains differ by 1% at the DNA sequence level and have colinear genetic maps (Chang et 
10 al, 1988; Koomneef, 1987). 

Centromere mapping with tetrad analysis requires simultaneous analysis of two 
markers, one of which must be centromere-linked (FIG. 1). To identify these centromere- 
hnked markers, markers distributed across all 5 chromosomes were scored and compared in a 
pairwise fashion. 

Initially, genetic markers that can be scored by PGR analysis were tested (Konieczny et 
al, 1 993; Bell et al, 1994). Such markers are sufficiently dense to map any locus to ± 10 cM 
and as additional PCR-detectable polymorphisms are identified they are incorporated into the 
analyses. As higher resolution mapping becomes necessary, selected RFLPs, from the existing 
set of over 350 markers will be scored. In addition, as described in FIG. 5, new CAPS and 
SSLP markers useful for mapping the centromere can be readily identified. 



15 
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25 



II. Tetrad Sets 



To date, progeny plants from 388 isolated tetrad seed sets have been germinated and 
leaf t,ssue collected and stored from each of the tetrad progeny plants. The leaf tissue from 
md^dual plants was used to make DNA for PGR based marker analysis. The plants also were 
allowed to self-fertilize and the seed they produced was collected (a list of the seed stock of 
mformative individuals used for tetrad analysis is given in FIG. 4). From each of these 
mdtvidual seed sets, seedlings can be germinated and their tissues utilized for making genomic 
DNA. Tl ssue pooled from multiple seedlings is useful for making Southern genomic DNA 
blots for the analysis of restriction fragment length polymorphisms (RFLPs). Informative 
plants which have been used for tetrad analysis are given in FIG. 4. 
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III. Ma pping Strateg y 

The initial mapping of the centromeres was completed with selected PCR™ based 
markers (such as CAPS and SSLPs) (FIG. 5). These markers were selected from the large 
number of markers available for Arabidopsis and are distributed across the genetic map. Due to 
5 the fact that analysis of RFLP markers is labor intensive, those markers that could be analyzed 
with PCR™ -based markers were surveyed first. Only a subset of the original 388 tetrads have 
provided crossovers close enough to their centromeres as to require the use of RFLP marker 
segregation analysis. For those tetrads in which existing markers do not identify the region of 
crossover between centromere and marker, new markers may be developed. This may be 

10 accomplished by screening cloned sequences in regions of interest for new potential markers, 
i.e. 9 RFLPs, CAPS, SSLPs, and the like. The new markers may then be used to generate new 
data with the relevant tetrads. In addition to the markers in centromeric regions, the segregation 
of two markers per chromosome that are both, far from, and flank the centromeric regions were 
assessed. Data from these markers will often be tetratype and thus identify whether a 

15 Landsberg to Columbia crossover or a Columbia to Landsberg crossover is visible as the 
centromere is approached. 

Codominant cleaved amplified polymorphic sequences (CAPS) were amplified by 
PCR™. The PCR™ products were observed on agarose gels prior to cleavage by restriction 
enzymes. Once cleaved, the products were scored for marker segregation on agarose or 
20 acrylamide gels. Simple sequence length polymorphisms (SSLPs) also were amplified by 
PCR™. The PCR™ products were directly scored by polyacrylamide gel electrophoresis 
(PAGE), or on agarose gels. 

IV. High Resolution Positioning of Centromeres on the Genetic Map 

Southern genomic DNA blots in combination with RFLP analysis may be used to map 
25 centromeres with a high degree of resolution. The stored seedling tissue provides the necessary 
amount of DNA for analysis of the restriction fragments. Southern blots are hybridized to 
probes labeled by radioactive or non-radioactive methods. 

It may, in many cases, be desired to identify new polymorphic DNA markers which are 
closely linked to the target region. In some cases this can be readily done. For example when 
30 comparing Landsberg and Columbia DNA, a polymorphic Sau3A site can be found for about 
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every 8 to 20 kB surveyed. Subtractive methods are available for identifying such 
polymorphisms (Rosenberg et al., 1994), and these subtractions may be performed using DNA 
from selected, centromeric YAC clones. Screens for RFLP markers potentially linked to 
centromeres also can be performed using DNA fragments from a centromere-linked YAC clone 
to probe blots of Landsberg and Columbia genomic DNA that has been digested with a panel of 
restriction enzymes. 

V ' Isolation of Centromere Containing I)NA Frafl men* 

Using the markers flanking each centromere (see FIG. 3) it is possible to purify a 
contiguous DNA fragment that contains both flanking markers and the centromere encoded 
between those markers. In order to carry this out, very large DNA fragments up to the size of 
an entire chromosome are prepared by embedding Arabidopsis tissues in agarose using, for 
example, the method described by Copenhaver et al., (1995). These large pieces of DNA can 
be digested in the agarose with any restriction enzyme. Those restriction enzymes which will 
be particularly useful for isolating intact centromeres include enzymes which yield very large 
DNA fragments. Such restriction enzymes include those with specificities greater than six base 
pairs such as, for example, Asc I, Bae I, BbvC I, Fse I, Not I, Pac I, Pme I, PpuM I, Rsr II 
SanD I, Sap I, SexA I, Sfi I, Sgf I, SgrA I, Sbf I, Srf I, Sse8387 1, Sse8647 1, Swa, UbaD I, and 
UbaE I, or any other enzyme that cuts at a low frequency within the Arabidopsis genome, and 
specifically within the centromeric region. Alternatively, a partial digest with a more frequent 
20 cutting restriction enzyme could be used. 

The large DNA fragments produced by digestion with restriction enzymes are then 
separated by size using pulsed-field gel electrophoresis (PFGE) (Schwartz al, 1982). 
Specifically, Contour-clamped Homogeneous Electric Field (CHEF) electrophoresis (a variety 
of PFGE) can be used to separate DNA molecules as large as 10 Mb (Chu et al, 1985). Large 
DNA fragments resolved on CHEF gels can then be analyzed using standard Southern 
hybridization techniques to identify and measure the size of those fragments which contain both 
centromere flanking markers and therefor, the centromere. After determining the size of the 
centromere containing fragment by comparison with known size standards, the region from the 
gel that contains the centromere fragment can be cut out of a duplicate gel. This centromeric 
DNA can then be analyzed, sequenced, and used in a variety of applications, as described 
below, including the construction of plant artificial chromosomes (PLACs). As indicated in 



25 
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detail below, PLACs can be constructed by attaching telomeres and selectable markers to the 
centromere fragment cut from the agarose gel using standard techniques which allow DNA 
ligation within the gel slice. Plant cells can then be transformed with this hybrid DNA 
molecule using the techniques described herein below. 

5 VI. PLAC Constructs 

In light of the instant disclosure it will be possible for those of ordinary skill in the art to 
construct the artificial chromosomes described herein. Useful construction methods are well- 
known (see, for example, Maniatis et al, 1982). As constructed, the PLAC will preferably 
include at least an autonomous replication sequence (ARS) functional in plants, a centromere 
10 functional in plants, and telomeres which are functional in plants. 

In addition to the basic elements, positive and negative selectable plant markers (e.g., 
antibiotic or herbicide resistance genes), and a cloning site for insertion of foreign DNA will 
preferably be included. In addition, a visible marker, such as green fluorescent protein, also 
may be desirable. In order to propagate the vectors in E. coli, it is necessary to convert the 

15 linear molecule into a circle by addition of a stuffer fragment between the telomeres. Inclusion 
of an E. coli plasmid replication origin and selectable marker also may be preferred. It also 
may be desirable to include Agrobacterium sequences to improve replication and transfer to 
plant cells. Exemplary artificial chromosome constructs are given in FIGs. 7A-7H, although it 
will be apparent to those in skill art that many changes may be made in the order and types of 

20 elements present in these constructs and still obtain a functional artificial chromosome within 
the scope of the instant invention. 

Artificial plant chromosomes which replicate in yeast also may be constructed to take 
advantage of the large insert capacity and stability of repetitive DNA inserts afforded by this 
system (see Burke et al y 1987). In this case, yeast ARS and CEN sequences are added to the 
25 vector. The artificial chromosome is maintained in yeast as a circular molecule using a stuffer 
fragment to separate the telomeres. 

A fragment of DNA, from any source whatsoever, may be purified and inserted into an 
artificial plant chromosome at any appropriate restriction endonuclease cleavage site. The 
DNA segment usually will include various regulatory signals for the expression of proteins 
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encoded by the fragment. Alternatively, regulatory signals resident in the artificial 
chromosome may be utilized. 

The techniques and procedures required to accomplish insertion are well-known in the 
art (see Maniatis et ai, 1982). Typically, this is accomplished by incubating a circular plasmid 
or a linear DNA fragment in the presence of a restriction endonuclease such that the restriction 
endonuclease cleaves the DNA molecule. Endonucleases preferentially break the internal 
phosphodiester bonds of polynucleotide chains. They may be relatively unspecific, cutting 
polynucleotide bonds regardless of the surrounding nucleotide sequence. However, the 
endonucleases which cleave only a specific nucleotide sequence are called restriction enzymes. 
Restriction endonucleases generally internally cleave DNA molecules at specific recognition 
sites, making breaks within "recognition" sequences that in many, but not all. cases exhibit two- 
fold symmetry around a given point. Such enzymes typically create double-stranded breaks. 

Many of these enzymes make a staggered cleavage, yielding DNA fragments with 
protruding single-stranded 5' or 3' termini. Such ends are said to be "sticky" or "cohesive" 
because they will hydrogen bond to complementary 3' or 5' ends. As a result, the end of any 
DNA fragment produced by an enzyme, such as EcoRI, can anneal with any other fragment 
produced by that enzyme. This properly allows splicing of foreign genes into plasmids, for 
example. Some restriction endonucleases that may be particularly useful with the current 
invention include Hindlll, Pstl, EcoRl, and BamUL 

Some endonucleases create fragments that have blunt ends, that is. that lack any 
protruding single strands. An alternative way to create blunt ends is to use a restriction enzyme 
that leaves overhangs, but to fill in the overhangs with a polymerase, such as klenow, thereby 
resulting in blunt ends. When DNA has been cleaved with restriction enzymes that cut across 
both strands at the same position, blunt end ligation can be used to join the fragments directly 
together. The advantage of this technique is that any pair of ends may be joined together, 
irrespective of sequence. 

Those nucleases that preferentially break off terminal nucleotides are referred to as 
exonucleases. For example, small deletions can be produced in any DNA molecule by 
treatment with an exonuclease which starts from each 3' end of the DNA and chews away single 
strands in a 3' to 5' direction, creating a population of DNA molecules with single-stranded 
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fragments at each end, some containing terminal nucleotides. Similarly, exonucleases that 
digest DNA from the 5' end or enzymes that remove nucleotides from both strands have often 
been used. Some exonucleases which may be particularly useful in the present invention 
include 2ta/31, 51, and £xoIII. These nucleolytic reactions can be controlled by varying the 
5 time of incubation, the temperature, and the enzyme concentration needed to make deletions. 
Phosphatases and kinases also may be used to control which fragments have ends which can be 
joined. Examples of useful phosphatases include shrimp alkaline phosphatase and calf 
intestinal alkaline phosphatase. An example of a useful kinase is T4 polynucleotide kinase. 

Once the source DNA sequences and vector sequences have been cleaved and modified 
10 to generate appropriate ends they are incubated together with enzymes capable of mediating the 
ligation of the two DNA molecules. Particularly useful enzymes for this purpose include T4 
ligase, E. coli ligase, or other similar enzymes. The action of these enzymes results in the 
sealing of the linear DNA to produce a larger DNA molecule containing the desired fragment 
(see, for example, U.S. Pat. Nos. 4,237,224; 4,264,731; 4,273,875; 4,322,499 and 4,336,336, 
1 5 which are specifically incorporated herein by reference). 

It is to be understood that the termini of the linearized plasmid and the termini of the 
DNA fragment being inserted must be complementary or blunt in order for the ligation reaction 
to be successful. Suitable complementarity can be achieved by choosing appropriate restriction 
endonucleases {i.e., if the fragment is produced by the same restriction endonuclease or one that 

20 generates the same overhang as that used to linearize the plasmid, then the termini of both 
molecules will be complementary). As discussed previously, in a preferred embodiment, at 
least two classes of the vectors used in the present invention are adapted to receive the foreign 
oligonucleotide fragments in only one orientation. After joining the DNA segment to the 
vector, the resulting hybrid DNA can then be selected from among the large population of 

25 clones or libraries. 

A method useful for the molecular cloning of DNA sequences includes in vitro joining 
of DNA segments, fragmented from a source high molecular weight genomic DNA, to vector 
DNA molecules capable of independent replication. The cloning vector may include plasmid 
DNA (see Cohen et al, 1973), phage DNA (see Thomas et al. y 1974), SV40 DNA (see 
30 Nussbaum et ai, 1976), yeast DNA, E. coli DNA and most significantly, plant DNA. 
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A variety of processes are known which may be utilized to effect transformation; i.e., 
the inserting of a heterologous DNA sequences into a host cell, whereby the host becomes 
capable of efficient expression of the inserted sequences. 

VII. Definitions 

By "transformation" or "transfection" is meant the acquisition in cells of new DNA 
sequences through incorporation of added DNA. This is the process by which naked DNA, 
DNA coated with protein, or whole artificial chromosomes are introduced into a cell, resulting 
in a heritable change. 

By "gene" is meant a DNA sequence that contains information for construction of a 
polypeptide or protein, and includes 5' and 3' ends. This also includes genes which encode only 
RNA products such as tRNA or rRNA genes. 

As used herein, "eukaryote" refers to living organisms whose cells contain nuclei. A 
eukaryote may be distinguished from a "prokaryote" which is an organism which lacks nuclei. 
Prokaryotes and eukaryotes differ fundamentally in the way their genetic information is 
organized, as well as their patterns of RNA and protein synthesis. 

By the term "lower eukaryote" is meant a eukaryote characterized by a comparatively 
simple physiology and composition, and most often unicellularity. Examples of lower 
eukaryotes include flagellates, ciliates, and yeast. 

By contrast, the term "higher eukaryote" means a multicellular eukaryote, typically 
characterized by its greater complex physiological mechanisms and relatively large size. 
Generally, complex organisms such as plants and animals are included in this category. 
Preferred higher eukaryotes to be transformed by the present invention include, for example, 
monocot and dicot angiosperm species, gymnosperm species, fern species, plant tissue culture 
cells of these species, animal cells and algal cells. It will of course be understood that 
prokaryotes and eukaryotes alike may be transformed by the methods of this invention. 

As used herein, the term "plant" includes plant cells, plant protoplasts, plant calli, and 
the like, as well as whole plants regenerated therefrom. 

As used herein, "heterologous gene" or "foreign gene" is a structural gene that is 
foreign, i.e., originating from a donor different from the host or a chemically synthesized gene, 
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and can include a donor of a different species from the host. The heterologous gene codes for a 
polypeptide or RNA ordinarily not produced by the organism susceptible to transformation by 
the expression vehicle. Another type of "heterologous gene' 1 is an altered gene from the host 
itself, or an unaltered gene which is present in one or more extra copies. One example of such 
5 an altered gene useful in the present invention is a mutant gene which encodes a herbicide- 
resistant form of a normally occurring enzyme. 

By "host" is meant any organism that is the recipient of a replicable plasmid, or 
expression vector comprising an artificial chromosome. Ideally, host strains used for cloning 
experiments should be free of any restriction enzyme activity that might degrade the foreign 

10 DNA used. Preferred examples of host cells for cloning, useful in the present invention, are 
bacteria such as Escherichia coli, Bacillus subtilis, Pseudomonas, Streptomyces, Salmonella, 
and yeast cells such as 5. cerevisiae. Host cells which can be targeted for expression of an 
artificial chromosome may be plant cells of any source and specifically include Arabidopsis, 
maize, rice, sugarcane, sorghum, barley, soybeans, tobacco, wheat, tomato, potato, citrus, or 

1 5 any other agronomically or scientifically important species. 

By "expression" is meant the process by which a structural gene produces an RNA 
molecule, typically termed messenger RNA (mRNA). The mRNA is typically, but not always, 
translated into polypeptide(s). 

By "linker" it is meant a DNA molecule, generally up to 50 or 60 nucleotides long and 
20 synthesized chemically, or cloned from other vectors. In a preferred embodiment, this fragment 
contains one, or preferably more than one, restriction enzyme site for a blunt-cutting enzyme 
and a staggered-cutting enzyme, such as BamHL One end of the linker fragment is adapted to 
be ligatable to one end of the linear molecule and the other end is adapted to be ligatable to the 
other end of the linear molecule. 

25 As used herein, a "library" is a pool of random DNA fragments which are cloned. In 

principle, any gene can be isolated by screening the library with a specific hybridization probe 
(see, for example, Young et ah, 1977). Each library may contain the DNA of a given organism 
inserted as discrete restriction enzyme-generated fragments or as randomly sheered fragments 
into many thousands of plasmid vectors. For purposes of the present invention, E. coli, yeast, 
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and Salmonella plasmids are particularly useful wh Pn th„ 

,y uselul when the genome inserts come from other 

organisms. 

By "hybridization" is mean, the pairing of complement RNA and DNA strands ,„ 
produce an RNA-DNA hybrid, or ahenaadve.y, U,= pairing of rivo DNA singie stands ta 
generally dtfferen, or ft. same sources * produce a double snanded DNA moiecule. 

The tern, -plasmid" or "cloning vector" as used herein refers to a Cosed covalendy 
cucuiar cx^chrcnoaoma, DNA or iinear DNA which is able ,„ autonomously rep,ica te in a 
hos, eel, and which is normally nonessenda, to me survival of the cell. A wide variety of 

10 fTt 7T' ^ *" k "° Wn "* ~' y ^ " ** M " Cohen 
■0 e a, U.S. Pa, No. which discloses exantples of DNA plasmids, and which is 

specifically incorporated herein by reference). 

As used herein, a "probe" is any biochemical reagen, (usuai.y ragged in some way for 
ease of .denhficarion), used to iden*y or iso,a,e a gene, a gene produo, a DNA segn.nL a 
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By "PLAC" it is meant a plant artificial chromosome of the current invention, as 
specifically disclosed herein. 

A "selecable marker" is a gene whose presence resuhs in a dear phenorype, and mos, 
onen a growth advantage for cells that contain the marice, This growth advantage may be 
present under srandard conditions, altered conditio, such as eievated ,e mp e ra ,ure, or in the 
„ o cerratn chemicaia such as herbicides or anUbiodcs. Use of se,ec,ab,e markers is 
deacrtbed, for example, in Broach e, al . (1979 , Exaraples of „ ^ ^ 
d^tdtne ktnase gene, the cellular adenine-phosphoriboayWeraae gene and the 
dthydry,fo,a,e reductase gene, hygromycin phosphobansferase genes, ,he bar gene and 
neomycm phosphoriansferase genes, among others. Prefer^ selectoble markers in the presen, 
- enuon mclude genes whose expression confer Abiotic or herbicide resist ,„ the boat 
ceii, suffice* to enabie the maintenance of a vector within the hoar cell, and which facibta* 
■he mampulahon of the plasmid into new hos, cells. Of particular inters i„ the present 
■nventton „ proteills confmi „ g ^ ^ ^ ^ 

tetracychne, G-41 8, bialaphos, and glvphosate for example. 
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By "PLAC-encoded protein" it is meant a polypeptide which is encoded by a sequence 
of a PLAC of the current invention. This includes those proteins encoded by functional 
sequences of the PLAC, such as selectable markers, telomeres, etc., as well as those proteins 
encoded by heterologous genes of the PLAC. 

5 By "PLAC-associated protein" it is meant a protein encoded by a sequence of the PLAC 

or a protein which is encoded by host DNA and binds with relatively high affinity to the 
centromeres of the current invention. 

VIII. Arabidopsis thaliana Centromeric DNA Segments as Hybridization Probes and 
Primers 

10 In addition to their use in the construction of PLACs, the centromeric regions disclosed 

herein also have a variety of other uses. For example, they also have utility as probes or 
primers in nucleic acid hybridization embodiments. As such, it is contemplated that nucleic 
acid segments that comprise a sequence region that consists of at least a 14 nucleotide long 
contiguous sequence that has the same sequence as, or is complementary to, a 14 nucleotide 

15 long contiguous DNA segment of a centromere of the current invention will find particular 
utility. Longer contiguous identical or complementary sequences, e.g., those of about 20, 30, 
40, 50, 100, 200, 500, 1000, 2000, 5000 bp, etc. including all intermediate lengths and up to 
and including the full-length sequence of a centromere of the current invention also will be of 
use in certain embodiments. 

20 The ability of such nucleic acid probes to specifically hybridize to centromeric 

sequences will enable them to be of use in detecting the presence of similar, partially 
complementary sequences from other plants or animals. However, other uses are envisioned, 
including the use of the centromeres for the preparation of mutant species primers, or primers 
for use in preparing other genetic constructions. 

25 Nucleic acid molecules having sequence regions consisting of contiguous nucleotide 

stretches of 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55, 60, 65, 70, 75, 80, 
85, 90, 95, 100 or even of 101-200 nucleotides or so, identical or complementary to DNA 
sequences of a centromere of the current invention, are particularly contemplated as 

30 hybridization probes for use in, e.g., Southern and Northern blotting. Smaller fragments will 
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genera,,, fl„d use „ hybrjdi2atio „ ^ ^ ^ ^ ^ ^ 
compiememary region may be varied, such as between abou. ,0-M and abou, ,00 or 200 

length complementary sequences one wishes !o detect. 
* Of conrse, fiagm ents may als0 be ob|ained fcy ^ ^ 

mechamca, shearing or by Morion enzyme diction. Smal, nuCeic acid segmems o 
may be readUy prepay by, for exampie, direcUy synthesizing me fragment by 
*~* means, as is common,, practiced using « aut „ma,ed CigonuCeotide synmesizer 
A.SO, flagmen, may be omained by appbeation of nuCeic acid reproduce tecbno.ogy such 

-»* by rn.rod.ing se,ec,ed sequences into recombinan, vectors for recombhtan 
pKK.uchon, and by other recombinan, D NA techniques general,y known to those of ski „ i„I 
art of molecular biology. 

According,,, me centromeres of the ctnren, invention may be used for meir abUhy to 

15 selectively form duplex molecules with „ . 

n „ „. p ex molecules with complementary stretches of DNA fragments 

Dependmg on the application envisioned, one will desire to emnln • 

hvhriHi. . emp,oy var y in 8 conditions of 

hybndtzauon ,o achteve varying degrees of se.ecivhy of probe towards .arge, sequence For 

apphcauona requiring high sCecdvity, one wi„ ty p ica „ y ^ l0 emp|oy 

cond nous to form the hybrids, e.,, one wi„ seiec, re,aflve,y ,ow sah and/or high .empera are 

.0 - - Provided by about 0.0 2 M to abou, 0,5 M NaC, a, .empeLes of abl 

0 c .0 about 70'C. Such se,ec,ive conditions .Cerate ,i„,e, if any, misma.cn be W ee„ mc 

2ml 17 M ~* - W0UU " « — - <— 

Z7L r™* Detec,ion of DNA ~ via hm °» - ' 

*» . me an, and .he things of U. S. Pa.en. 4,965,138 and 5 ,,7699 5 (ea ch 

*ec, ca y ■»« herein by reference in i b en.ire W are exempt of me nJcTf 
ybnd,za,,on ana lyS es. Teachings such as tnose found in me texts of Mamy e, „, ,99, ■ Sega, 
.976 ; Pro k „p 1 99 1 ;andKuby„94,are P ar.icu 1 ar,y re ,evan,. * '»'•** 

Of course, for some appneauons, for examp.e, where one desires ,o prepare mu^ 
enrp oyme . mutm plfaer ^ ^ ^ _ ^ P 

centre mncion-encoding sequences flom re,a,ed species, flmcdon, equivaien., or 
»e .Ore, ,ess sflmgen, h ybridiza,io„ conditions wiU .ypicaj.y be needed in order .o a„ „ 
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formation of the heteroduplex. In these circumstances, one may desire to employ conditions 
such as about 0. 1 5 M to about 0.9 M salt, at temperatures ranging from about 20°C to about 
55°C. Cross-hybridizing species can thereby be readily identified as positively hybridizing 
signals with respect to control hybridizations. In any case, it is generally appreciated that 
5 conditions can be rendered more stringent by the addition of increasing amounts of formamide, 
which serves to destabilize the hybrid duplex in the same manner as increased temperature or 
decreased salt. Thus, hybridization conditions can be readily manipulated, and thus will 
generally be a method of choice depending on the desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences of the 
10 present invention in combination with an appropriate means, such as a label, for determining 
hybridization. A wide variety of appropriate indicator means are known in the art, including 
fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, which are capable of 
giving a detectable signal. In preferred embodiments, one will likely desire to employ a 
fluorescent label or an enzyme tag, such as urease, alkaline phosphatase or peroxidase, instead 
15 of radioactive or other environmentally undesirable reagents. In the case of enzyme tags, 
colorimetric indicator substrates are known that can be employed to provide a means visible to 
the human eye or spectrophotometrically, to identify specific hybridization with complementary 
nucleic acid-containing samples. 

In general, it is envisioned that the hybridization probes described herein will be usefiil 
20 both as reagents in solution hybridization as well as in embodiments employing a solid phase. 
In embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise 
affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is then 
subjected to specific hybridization with selected probes under desired conditions. The selected 
conditions will depend on the particular circumstances based on the particular criteria required 
25 (depending, for example, on the G+C content, type of target nucleic acid, source of nucleic 
acid, size of hybridization probe, etc.). Following washing of the hybridized surface so as to 
remove nonspecifically bound probe molecules, specific hybridization is detected, or even 
quantitated, by means of the label. 
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IX ' PLAC -Associ a red Protein Sp rrifir Antibody Condition, an H m^--^ 
Making 

In particular embodiment, the inventors contemphte me „ of 
monoc 0M , or po,yc,o„al which hind to PLAOassociated proteins „ f me cumnl inveMio „. 
Such iPLAC-associafcd proteins include pro,e.ns which ace coded h, the s_ of the 
PLAC, as we,, as proteins encoded hy host DNA tha, hind to Ute centromeres of the current 
.nvennon. „ is speciflcallv comemplated ^ ^ ^^.^ 

antthodtes won,d a„ow for the further iso,a,ion and enaction of the PLAC-associated 
protetna For examp,e, proteins may he isolated which are encoded by the centromeres. 
Recombtnan, production of such proteins provi des . Murce of for 
antibodies. * 

Altema,ive,y, the cemromere may he used as a hgand to isolate, using affinity methods 
cenuomere binding proteins. Once iso,ated, these protein can he used as antigens for the 
productton po,yc,ona, and monoclonal antibodies. A variation on mis ,ecrau q ue has been 
d_,ed by Ratmer („,,), by Coning of centrotnere-associated proreins through the use 
of anttbodtes which bind in the vicinity of the centromere. 

Means for preparing and characterizing antibodies are well known in the w (see , . 
Antthodies: A Laboratory Manual, Cold Spring Harbor Laboratory, ,988; incorporated herein' 
by reference). The methods for gc „ erating monoCona, antibodies (mAbs) generaUy begin 
along the same lines as those for prying polyCona, antibodies. Briefly, a polyclonal 
anttbody ,s prepared by immunizing an amma, with an immunogenic composition in 
accordance with the present invention and collecting andsera from tha, immunized animal A 
wtde range of aruma, species can be used for the production of antisera. Typically Ute animal 
used for production of antisera is a rabbit, a mouse, a rat, a hamster, a guinea pig or a goat A 
abbu ,s a prefer choice for production of p„,yc,„„, antibodies because of the ease of 
handling, maintenance and relatively large blood volume. 

As is well known in Ute art, a given composition may vary in its immunogenic^. „ is 
often necessary therefore to boos, me host immune sysfcm, as may be achieved by coupling a 
peptide o, polypeptide immunogen to a carrier. Exempt and preferred carriers are keyhole 
lunpe, hemocyanin (KLH) and bovine serum albumin (BSA). Otiter albumins such as 
ovalbumtn, mouse serum albumin or rabbi, serum albumin also can be used as carrier, Means 
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for conjugating a polypeptide to a carrier protein are well known in the art and include 
glutaraldehyde, m-maleimidobencoyl-N-hydroxysuccinimide ester, carbodimide and bis- 
biazotized benzidine. 

As is also well known in the art, the immunogenicity of a particular immunogen 
5 composition can be enhanced by the use of non-specific stimulators of the immune response, 
known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuvant (a 
non-specific stimulator of the immune response containing killed Mycobacterium tuberculosis), 
incomplete Freund's adjuvants and aluminum hydroxide adjuvant. 

The amount of immunogen composition used in the production of polyclonal antibodies 
10 varies upon the nature of the immunogen as well as the animal used for immunization. A 
variety of routes can be used to administer the immunogen (subcutaneous, intramuscular, 
intradermal, intravenous and intraperitoneal). The production of polyclonal antibodies may be 
monitored by sampling blood of the immunized animal at various points following 
immunization. A second, booster, injection also may be given. The process of boosting and 
15 titering is repeated until a suitable titer is achieved. When a desired level of immunogenicity is 
obtained, the immunized animal can be bled and the serum isolated and stored, and/or the 
animal can be used to generate mAbs. 

Monoclonal antibodies may be readily prepared through use of well-known techniques, 
such as those exemplified in U. S. Patent 4,196,265, incorporated herein by reference. 

20 Typically, this technique involves immunizing a suitable animal with a selected immunogen 
composition, e.g., a purified or partially purified PLAC-associated protein, polypeptide or 
peptide. The immunizing composition is administered in a manner effective to stimulate 
antibody producing cells. Rodents such as mice and rats are preferred animals, however, the 
use of rabbit, sheep, or frog cells also is possible. The use of rats may provide certain 

25 advantages (Goding 1986), but mice are preferred, with the BALB/c mouse being most 
preferred as this is most routinely used and generally gives a higher percentage of stable 
fusions. 

Following immunization, somatic cells with the potential for producing antibodies, 
specifically B lymphocytes (B cells), are selected for use in the mAb generating protocol. 
30 These cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a peripheral 
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blood sample. Spleen cel.s and peripheral Mood ce„s a, prefer^ the fotnter because they a. 

a nca source of anubooy-producing cells ^ „ fa ^ ^ 

latter because peripheral blood is ^ ^ a ^ rf ^ ^ 

— n. and the spte en of animal ^ to ^ ^ ^ ^ fe 

spleen lymphocytes obtained by homogenizing ta spleen wjlh . ^ a 

from an tmmuntzed mouse contains approximately 5 x 10' ,„ 2 * ,„' lymphocytes. 

The antibody-producing B lymphocytes fern the immunized animal are then fused with 

— . Myeloma ca„ lines suited for use in hybridoma-pmducmg (Usion pi0Kdurcs 
W are „o,an tib ody-p ro d„ci„ g , have high fitsion efficiency, and en^te deficiencies 

oT 7T T mbk ° f grOWi " 8 ta COTain ^ * growth of 

only the desired fiisad cells (hybridomas). 

7 °" e ° f a of myeloma cells may be used, as a, kno™ t0 ^ of sld „ fa 

* ar, (Godmg 1 0 86 ; Campbell 1M4 , For example , where ^ ^ 

MFCU-X45-GTG 1.7 and S194/5YYA r..i. f~ 

OAX0 Bul ' for rats > one may use R210.RCY3 Y3-Ae 1 2 1 
™3P and 4B2 ,0 ; and U- 266 , CM, S 00-O R C 2 , Oa WHMy2 and UC^a^ 
useful in connection with human cell fusions. 

One preferred murine myeloma ca« is me NS-1 myeloma eel! line (also tanned P3-NS- 
1-A84-!,, whtch » re adi,y available ftom the NIOMS Human Generic Mutan, Cell Repository 
by remreshng cell line tapository number QM3573. Another mouse myeloma cel, fine ma, ma^ 
be used ,s me 8-azaguanine-resistan, mouse murine myeloma SP2/0 non-producer cell line. 

Methods for genemung hybrids of anfibo^ucmg S p,ee„ or lymph node cells and 
m el ma cells usually contprise mhring sotnatie cells with myeloma ceUs in a 2: , ratio, though 
me rarto may vary ftom about 20:1 ,o about „ fc ^ 

a^s chemtca, or e.ecurical, that promote the fttsion of cel, membranes. Pusion methods 
usntg Sendat vtrus have been described (Kohler a, , 975; I976) , md ^ usklg 
Po.yemy.ene glycol (PEG), such as 37% ,v/v) PEG, (Oefter « «,, l977) ^ ^ 0 
e.ecncally induced fiusion methods also is apptopriate (Ooding 1986, 
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Fusion procedures usually produce viable hybrids at low frequencies, about 1 x 10" 6 to 
1 x 1 0" 8 . However, this does not pose a problem, as the viable, fused hybrids are differentiated 
from the parental, unfused cells (particularly the unfused myeloma cells that would normally 
continue to divide indefinitely) by culturing in a selective medium. The selective medium is 
5 generally one that contains an agent that blocks the de novo synthesis of nucleotides in the 
tissue culture media. Exemplary and preferred agents are aminopterin, methotrexate, and 
azaserine. Aminopterin and methotrexate block de novo synthesis of both purines and 
pyrimidines, whereas azaserine blocks only purine synthesis. Where aminopterin or 
methotrexate is used, the media is supplemented with hypoxanthine and thymidine as a source 
10 of nucleotides (HAT medium). Where azaserine is used, the media is supplemented with 
hypoxanthine. 

The preferred selection medium is HAT. Only cells capable of operating nucleotide 
salvage pathways are able to survive in HAT medium. The myeloma cells are defective in key 
enzymes of the salvage pathway, e.g., hypoxanthine phosphoribosyl transferase (HPRT), and 
15 they cannot survive. The B-cells can operate this pathway, but they have a limited life span in 
culture and generally die within about two weeks. Therefore, the only ceils that can survive in 
the selective media are those hybrids formed from myeloma and B-cells. 

This culturing provides a population of hybridomas from which specific hybridomas are 
selected. Typically, selection of hybridomas is performed by culturing the cells by single-clone 
20 dilution in microtiter plates, followed by testing the individual clonal supernatants (after about 
two to three weeks) for the desired reactivity. The assay should be sensitive, simple and rapid, 
such as radioimmunoassays, enzyme immunoassays, cytotoxicity assays, plaque assays, dot 
immunobinding assays, and the like. 

The selected hybridomas would then be serially diluted and cloned into individual 
25 antibody-producing cell lines, which clones can then be propagated indefinitely to provide 
mAbs. The cell lines may be exploited for mAb production in two basic ways. A sample of the 
hybridoma can be injected (often into the peritoneal cavity) into a histocompatible animal of the 
type that was used to provide the somatic and myeloma cells for the original fusion. The 
injected animal develops tumors secreting the specific monoclonal antibody produced by the 
30 fused cell hybrid. The body fluids of the animal, such as serum or ascites fluid, can then be 
tapped to provide mAbs in high concentration. The individual cell lines also could be cultured 
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*n** where U,e ^ a. ^ ^ ^ ^ ^ ^ 

i„ hi g h c _ Uon , ^ produMd by ejihw J - * 

X - ELISAs and Immunnpr^ip^^^n,, 

- EUS^ZT ^ C °" jUnC, ' 0n * ^ » > 

pep.,es PLAC . encoded prolei „ ^ ^ ^ JV P ™- 
surface, a OTfece exhibi|i „ g . ^ affinjty ^ ^ ^ 

— p «e. Aft e, wsWng . remove ^ ~ 
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of ^ po„ d e, This a,,™ for blocking of „ fic 
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1 5 surface 
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enzyme that will generate color or light development upon incubating with an appropriate 
chromogenic substrate. Thus, for example, one will desire to contact and incubate the antisera- 
bound surface with a urease or peroxidase-conjugated anti-human IgG for a period of time and 
under conditions which favor the development of immunocomplex formation (e.g., incubation 
5 for 2 hours at room temperature in a PBS-containing solution). 

After incubation with the second enzyme-tagged antibody, and subsequent to washing 
to remove unbound material, the amount of label is quantified by incubation with a 
chromogenic substrate such as urea and bromocresol purple or 2,2'-azino-di-(3-ethyl- 
benzthiazoline)-6-sulfonic acid (ABTS) and H 2 0 2 , in the case of peroxidase as the enzyme 
10 label. Quantitation is then achieved by measuring the degree of color generation, e.g., using a 
visible spectra spectrophotometer. 

XI. Western Blots 

The compositions of the present invention may find use in immunoblot or western blot 

analysis. The antibodies of the invention may be used as high-affinity primary reagents for the 
1 5 identification of proteins immobilized onto a solid support matrix, such as nitrocellulose, nylon 

or combinations thereof. In conjunction with immunoprecipitation, followed by gel 

electrophoresis, these may be used as a single step reagent for use in detecting antigens against 

which secondary reagents used in the detection of the antigen cause an adverse background. 

This is especially useful when the antigens studied are immunoglobulins (precluding the use of 
20 immunoglobulins binding bacterial cell wall components), the antigens studied cross-react with 

the detecting agent, or they migrate at the same relative molecular weight as a cross-reacting 

signal. 

Immunologically-based detection methods for use in conjunction with Western blotting 
include enzymatically-, radiolabel-, or fluorescently-tagged secondary antibodies against the 
25 protein moiety are considered to be of particular use in this regard. 
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XII. DNA Seff mPnfc 

Further aspect of tire present invention concern isolated DNA segment and 
reeombinan, vectors encoding a fictional ArabUopsis Miana centiomere and other 
sequences for the creation and use of recombinant sequences of a PLAC. 

The present invention concerns DNA segments, isolatable from A. lhaUana colls, that 
are ennched relative to total genomic DNA and are capable of confemng centromere activity to 
a recombman. mo.ecule when incorporated into tire host cell. As used herein, the term 
centromere activity indicates the ability ,„ confer stable inheritance upon . DNA segment 
artificial chromosome or chromosome. 

As used herein, the term "DNA segment" refers to a DNA mo.ecule tha, has been 
punfied from total genomic DNA of a particular species. Therefore, a DNA segment encoding 
centromere function refers to a DNA segment that contain* centiomere coding sequences ye, is 
■solated away from, or purified free from, fond genomic DNA of A. Miana. fncluded within 
the rem, "DNA segment", are DNA segments and smafler fragments of such segments, and a!so 
recombinant vectors, including, for examp.e, plasmids, cosmids, phage, viruses, and the like. 

Simihriy, a DNA segment comprising an isolated or purified cemromeric sequence 
refers to a DNA segment including centiomere coding sequences and, in certain aspect* 
regulatory sequences, iso,a,ed substantial* away from other naturally occuning genes, protein 
encoding sequences, or outer DNA sequences. .„ tins respect, the rem, -gen," is used for 
stmphcity «„ refer t0 . toctional DNA segment, protein, polypeptide or peptide encoding unit 
As wil, be understood by fhose in fhe art, this tactiona. ferm includes bod, genomic sequences' 
cDNA sequences and smaller engineered gene segments tha, may express, or may be adapted to' 
express, proteins, polypeptides or peptides. 

"Isolated substantially away from ofher coding sequences" means that the sequences of 
■uteres,, m tins case centiomere faction encoding sequences, are included within the genomic 
DNA clones provided herein. Of course, tins refers ,„ tire DNA segmen, as originally isolafcd 
and does no, exclnde genes or c^ing regions lMer added „ , he ^ fcy fc ^ rf ^ 

In particular embodiments, tire invention concerns isolated DNA segments and 
reeombinan. vectors incorporating DNA sequences tha, encode a centromere functional 
sequence ma, includes a contiguous sequence from fhe cenfromeres of tire cunen, invention In 
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certain other embodiments, the invention concerns isolated DNA segments and recombinant 
vectors that include within their sequence a contiguous nucleic acid sequence from an A. 
thaliana centromere. Again, DNA segments that exhibit centromere function activity will be 
most preferred. 

5 The nucleic acid segments of the present invention, regardless of the length of the 

sequence itself, may be combined with other DNA sequences, such as promoters, 
polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding 
segments, and the like, such that their overall length may vary considerably. It is therefore 
contemplated that a nucleic acid fragment of almost any length may be employed, with the total 
10 length preferably being limited by the ease of preparation and use in the intended recombinant 
DNA protocol. 

XIII. Biological Functional Equivalents 

Modification and changes may be made in the centromeric DNA segments of the 
current invention and still obtain a functional molecule with desirable characteristics. The 
15 following is a discussion based upon changing the nucleic acids of a centromere to create an 
equivalent, or even an improved, second-generation molecule. 

In particular embodiments of the invention, mutated centromeric sequences are 
contemplated to be useful for increasing the utility of the centromere. It is specifically 
contemplated that the function of the centromeres of the current invention may be based upon 

20 the secondary structure of the DNA sequences of the centromere and / or the proteins which 
interact with the centromere. By changing the DNA sequence of the centromere, one may alter 
the affinity of one or more centromere-associated protein(s) for the centromere and / or the 
secondary structure of the centromeric sequences, thereby changing the activity of the 
centromere. Alternatively, changes may be made in the centromeres of the invention which do 

25 not effect the activity of the centromere. Changes in the centromeric sequences which reduce 
the size of the DNA segment needed to confer centromere activity are contemplated to be 
particularly useful in the current invention, as would changes which increased the fidelity with 
which the centromere was transmitted during mitosis and meiosis. 
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MV ' ° btainin *^ ^^CentrommrDN^ from V^t ^ BacterialClones 

The Arabidopsis physical map consists primarily of YAC clones between 200 kb and 1 
Mb m length, and an overlapping contig map with several anchors to the genetic map is 
avauable (Hwang et ai, 1991) (http://cbil.humgen.upenn.edu/atgc/ATGCUPhtml) To be 
■ certam that an entire centromeric region has been cloned, clones or a senes of clones, are 
■dentmed that hybridize to markers on either side of each centromere. These efforts can be 
complicated by the presence of repetitive DNA in the centromere, as well as by the potential 
mstability of centromere clone, Thus, identification of large YACs with unique sequences that 
wUl serve as useful probes simplifies a chromosome walking strategy. Use of BACs may also 
be advantageous, as data has suggested that YAC clones may sometimes not span centromeres 
(Willard, 1997). 

Blot hybridization allows comparison of the structure of the clones with that of genomic 
DNA, and thus determines whether the clones have suffered deletion* or managements. The 
centromeric clones identified are useful for hybridization experiments that can be used to 
determine whether they share common sequences, whether drey localize ,n sUu «„ the 
cytologicafiy defined centiomeric region, and wherher they contain repetitive sequences 
though, to map near ArabUopsis centromeres (Richards e, a,., 1991; Maluszynska e, at. ,991). 

In a positional cloning approach, one may wish to begin sequencing if the region of 
■merest has been narrowed ,„ a sufficiently small area. The determination of what constitutes a 
sufflciem.y small sequence is dependent upon many factors inc.uding, bu, no, necessarily 
Itmtted ,„, me repetitive DNA comen, of ,he region of i„,eres,, Ute size of me region to be 
sequenced, the abiUty ,„ ob«ain higher resolution genetic mapping data, and tire relative 
efflcrency of sequencing technology. ,„ many cases> ^ y ^ ^ ^ ^ 

regton has been narrowed ,„ 40 kB or less. I, is es«ima,ed ,ha, in ArabUopsis .Haliana 1 cM 
corresponds ,oapproxima«ely 200 kB (Hwang, 1991; Koomneef, 1987). Such a resolution will 
on average, be provided by analysis of abou, ,00 tetrads for each cemromere, altitough some 
cenuomeres may be mapped with higher resolution due ,„ dte fac, that each ,«rad is 
stmultaneously mapping five centtomeres, and thus has five times «he usual probabili* of 
findtngacressoververy close toa locus of interest. Nonetheless, because the amount of DNA 
pet CM can increase in the vicinity of the cenfromere (Crupenter e, ai, .982), examination of 
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additional tetrad progeny and identification of additional genetic markers may be required to 
achieve the desired degree of genetic linkage. 

XV. Transformed Host Cells and Transgenic Plants 

Methods and compositions for transforming a bacterium, a yeast cell, a plant cell, or an 
5 entire plant with one or more artificial chromosomes are further aspects of this disclosure. A 
transgenic bacterium, yeast cell, plant cell or plant derived from such a transformation process 
or the progeny and seeds from such a transgenic plant also are further embodiments of the 
invention. 

Means for transforming bacteria and yeast cells are well known in the art. Typically, 
10 means of transformation are similar to those well known means used to transform other bacteria 
or yeast such as E. coli or Saccharomyces cerevisiae. Methods for DNA transformation of 
plant cells include Agrobacterium-mediated plant transformation, protoplast transformation, 
gene transfer into pollen, injection into reproductive organs, injection into immature embryos 
and particle bombardment. Each of these methods has distinct advantages and disadvantages. 
15 Thus, one particular method of introducing genes into a particular plant strain may not 
necessarily be the most effective for another plant strain, but it is well known in the art which 
methods are useful for a particular plant strain. 

There are many methods for introducing transforming DNA segments into cells, but not 
all are suitable for delivering DNA to plant cells. Suitable methods are believed to include 

20 virtually any method by which DNA can be introduced into a cell, such as by Agrobacterium 
infection, direct delivery of DNA such as, for example, by PEG-mediated transformation of 
protoplasts (Omirulleh et al t 1993), by desiccation/inhibition-mediated DNA uptake, by 
electroporation, by agitation with silicon carbide fibers, by acceleration of DNA coated 
particles, etc. In certain embodiments, acceleration methods are preferred and include, for 

25 example, microprojectile bombardment and the like. 

Technology for introduction of DNA into cells is well-known to those of skill in the art. 
Four general methods for delivering a gene into cells have been described: (1) chemical 
methods (Graham et al. 9 1973; Zatloukal et al, 1992); (2) physical methods such as 
microinjection (Capecchi, 1980), electroporation (Wong et ai 9 1982; Fromm et ai 9 1985; U. S. 
30 Patent No. 5,384,253) and the gene gun (Johnston et al, 1994; Fynan et aL, 1993); (3) viral 
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vectors (Ciapp 1993; Lu et al, 1993; Eglitis et al, 1988a; 1988b); and (4) receptor-mediated 
mechanisms (Curiel et al, 1991; 1992; Wagner et al, 1992). 

(i) Electroporation 

The application of brief, high-voltage electric pulses to a variety of animal and plant 
ceils leads to the formation of nanometer-sized pores in the plasma membrane. DNA is taken 
d,rectly into the cell cytoplasm either through these pores or as a consequence of the 
redistribution of membrane components that accompanies closure of the pores. Electroporation 
can be extremely efficient and can be used both for transient expression of cloned genes and for 
establishment of cell lines that carry integrated copies of the gene of interest. Electroporation 
m contrast to calcium phosphate-mediated transfection and protoplast fusion, frequendy gives 
rise to cell lines that carry one, or at most a few, integrated copies of the foreign DNA. 

The introduction of DNA by means of electroporation, is well-known to those of skill in 
the art. In this method, certain cell wall-degrading enzymes, such as pectin-degrading enzymes, 
are employed to render the target recipient cells more susceptible to transformation by 
electroporation than untreated cells. Alternatively, recipient cells are made more susceptible to 
transformation, by mechanical wounding. To effect transformation by electroporation one may 
employ either friable tissues such as a suspension culture of cells, or endogenic callus or 
alternatively, one may transform immature embryos or other organized tissues directly One 
would partially degrade the cell walls of the chosen cells by exposing them to pectin-degrading 
enzymes (pectolyases) or mechanically wounding in a controlled manner. Such cells would 
then be recipient to DNA transfer by electroporation, which may be carried out at this stage 
and transformed cells then identified by a suitable selection or screening protocol dependent on 
the nature of the newly incorporated DNA. 

(ii) Microprojectile Bombardment 

A further advantageous method for delivering transforming DNA segments to plant 
cells ,s microprojectile bombardment. In this method, particles may be coated with nucleic 
actds and delivered into cells by a propelling force. Exemplary particles include those 
comprised of tungsten, gold, platinum, and the like. 

An advantage of microprojectile bombardment, in addition to it being an. effective 
means of reproducibly stably transforming monocots, is that neither the isolation of protoplasts 
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(Cristou et a/., 1988) nor the susceptibility to Agrobacterium infection is required. An 
illustrative embodiment of a method for delivering DNA into maize cells by acceleration is a 
Biolistics Particle Delivery System, which can be used to propel particles coated with DNA or 
cells through a screen, such as a stainless steel or Nytex screen, onto a filter surface covered 
5 with plant cells cultured in suspension. The screen disperses the particles so that they are not 
delivered to the recipient cells in large aggregates. It is believed that a screen intervening 
between the projectile apparatus and the cells to be bombarded reduces the size of projectiles 
aggregate and may contribute to a higher frequency of transformation by reducing damage 
inflicted on the recipient cells by projectiles that are too large. 

10 For the bombardment, cells in suspension are preferably concentrated on filters or solid 

culture medium. Alternatively, immature embryos or other target cells may be arranged on 
solid culture medium. The cells to be bombarded are positioned at an appropriate distance 
below the macroprojectile stopping plate. If desired, one or more screens also are positioned 
between the acceleration device and the cells to be bombarded. Through the use of techniques 

15 set forth herein one may obtain up to 1000 or more foci of cells transiently expressing a marker 
gene. The number of cells in a focus which express the exogenous gene product 48 hours post- 
bombardment often range from 1 to 10 and average 1 to 3. 

In bombardment transformation, one may optimize the prebombardment culturing 
conditions and the bombardment parameters to yield the maximum numbers of stable 

20 transformants. Both the physical and biological parameters for bombardment are important in 
this technology. Physical factors are those that involve manipulating the DNA/microprojectile 
precipitate or those that affect the flight and velocity of either the macro- or microprojectiles. 
Biological factors include all steps involved in manipulation of cells before and immediately 
after bombardment, the osmotic adjustment of target cells to help alleviate the trauma 

25 associated with bombardment, and also the nature of the transforming DNA, such as linearized 
DNA or intact supercoiled plasmids. It is believed that pre-bombardment manipulations are 
especially important for successful transformation of immature embryos. 

Accordingly, it is contemplated that one may wish to adjust various of the bombardment 
parameters in small scale studies to fully optimize the conditions. One may particularly wish to 
30 adjust physical parameters such as gap distance, flight distance, tissue distance, and helium 
pressure. One also may minimize the trauma reduction factors (TRFs) by modifying conditions 
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which influence the physiological state ofthe recipient cells and which may therefore influence 
transformation and integration efficiencies. For example, the osmotic state, tissue hydration 
and the subculture stage or cell cycle of the recipient cells may be adjusted for optimum 
transformation. The execution of other routine adjustments will be known to those of skill in 
the art in light of the present disclosure. 

(iii) ^groAac/mi/m-Mediated Transfer 

Agrobacterium-mednXed transfer is a widely applicable system for introducing genes 
mto plant cells because the DNA can be introduced into whole plant tissues, thereby bypassing 
the need for regeneration of an intact plant from a protoplast. The use of Agrobacterium- 
mediated plant integrating vectors to introduce DNA into plant cells is well known in the art. 
See, for example, the methods described (Fraley et al, 1985; Rogers et al, 1987) Using 
conventional transformation vectors, chromosomal integration is required for stable inheritance 
of the foreign DNA. However, the vector described herein may be used for transformation 
with or without integration, as the centromere function required for stable inheritance is 
encoded within the PLAC. In particular embodiments., transformation events in which the 
PLAC is not chromosomally integrated may be preferred, in that problems with site-specific 
variations in expression and insertional mutagenesis may be avoided. 

The integration of the Ti-DNA is a relatively precise process resulting in few 
rearrangements. The region of DNA to be transferred is defined by the border sequences, and 
intervening DNA is usually inserted into the plant genome as described (Spielmann et al 1986- 
Jorgensen et al, 1987). Modem Agrobacterium transformation vectors are capable of 
replication in £ coll as well as Agrobacterium, allowing for convenient manipulations as 
descnbed (Klee et ai, ,985). Moreover, recent technological advances in vectors for 
Agra W/^-mediated gene transfer have improved the arrangement of genes and restriction 
sites in the vectors to facilitate construction of vectors capable of expressing various 
polypeptide coding genes. The vectors described (Rogers et al , 1987), have convenient multi- 
hnker regions flanked by a promoter and a polyadenylation site for direct expression of inserted 
polypeptide coding genes and are suitable for present purposes. In addition, Agrobacterium 
containing both armed and disarmed Ti genes can be used for the transformations. In those 
plant strains where Agra bacterium^*** transformation is efficient, it is the method of 
choice because of the facile and defined nature ofthe gene transfer. 
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Agrobacterium-mediated transformation of leaf disks and other tissues such as 
cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally infects. 
Agrobacterium-mzdiated transformation is most efficient in dicotyledonous plants. Few 
monocots appear to be natural hosts for Agrobacterium, although transgenic plants have been 
5 produced in asparagus and more significantly in maize using Agrobacterium vectors as 
described (Bytebier et al. y 1987; U.S. Patent No. 5,591,616, specifically incorporated herein by 
reference). Therefore, commercially important cereal grains such as rice, corn, and wheat must 
usually be transformed using alternative methods. However, as mentioned above, the 
transformation of asparagus using Agrobacterium also can be achieved (see, for example, 
10 Bytebier et al, 1987). 

A transgenic plant formed using Agrobacterium transformation methods typically 
contains a single gene on one chromosome. Such transgenic plants can be referred to as being 
hemizygous for the added gene. A more accurate name for such a plant is an independent 
segregant, because each transformed plant represents a unique T-DNA integration event. 

15 More preferred is a transgenic plant that is homozygous for the added foreign DNA; i.e., 

a transgenic plant that contains two copies of a transgene, one gene at the same locus on each 
chromosome of a chromosome pair. A homozygous transgenic plant can be obtained by 
sexually mating (selfing) an independent segregant transgenic plant that contains a single added 
transgene, germinating some of the seed produced and analyzing the resulting plants produced 

20 for enhanced activity relative to a control (native, non-transgenic) or an independent segregant 
transgenic plant. 

Even more preferred is a plant in which the PLAC has not been chromosomally 
integrated. Such a plant may be termed 2n + x, where 2n is the diploid number of 
chromosomes and where x is the number of PLACs. Initially, transformants may be 2n+l, i.e. 
25 having 1 additional PLAC. In this case, it may be desirable to self the plant or to cross the plant 
with another 2n + 1 plant to yield a plant which is 2n + 2. The 2n + 2 plant is preferred in that 
it is expected to pass the PLAC through meiosis to all its offspring. 

It is to be understood that two different transgenic plants also can be mated to produce 
offspring that contain two independently segregating added, exogenous PLACs. Selfing of 
30 appropriate progeny can produce plants that are homozygous for both added, exogenous PLACs 
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that encode a polypeptide of interest. Back-crossing to a parental plant and out-crossing with a 
non-transgenic plant also are contemplated. 

XVI. Other Tra nsformation Methods 

Transformation of plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of 
these treatments (see, e.g., Potrykus et al., 1985; Lorz et al, 1985; Fromm al, 1986; 
Uchimiyae/o/., 1986; Callis etal, 1987; Marcottee/a/., 1988). 

Application of these systems to different plant strains for the purpose of making 
transgenic plants depends upon the ability to regenerate that particular plant strain from 
protoplasts. Illustrative methods for the regeneration of cereals from protoplasts are described 
(Fujimura et al, 1985; Toriyama et al, 1986; Yamada et al., 1986; Abdullah et al., 1986). 

To transform plant strains that cannot be successfully regenerated from protoplasts, 
other ways to introduce DNA into intact cells or tissues can be utilized. For example, 
regeneration of cereals from immature embryos or explants can be effected as described (Vasil 
1988). In addition, "particle gun" or high-velocity microprojectile technology can be utilized 
(Vasil 1992). 

Using that latter technology, DNA is carried through the cell wall and into the 
cytoplasm on the surface of small metal particles as described (Klein et al., 1987; Klein et al., 
1988; McCabe et al., 1988). The metal particles penetrate through several layers of cells and 
thus allow the transformation of cells within tissue explants. 

XVII. Exogenous Genes for Expression in Plan»« 

One particularly important advance of the present invention is that it provides methods 
and compositions for expression of exogenous genes in plant cells. Significantly, the current 
invention allows for the transformation of plant cells with a PLAC comprising a number of 
exogenous genes. Such genes often will be genes that direct the expression of a particular 
protein or polypeptide product, but they also may be non-expressible DNA segments, e.g., 
transposons such as Ds that do not direct their own transposition. As used herein, an' 
"expressible gene" is any gene that is capable of being transcribed into RNA (e.g., mRNA, 
antisense RNA, etc.) or translated into a protein, expressed as a trait of interest, or the like, etc., 
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and is not limited to selectable, screenable or non-selectable marker genes. The inventors also 
contemplate that, where both an expressible gene that is not necessarily a marker gene is 
employed in combination with a marker gene, one may employ the separate genes on either the 
same or different DNA segments for transformation. In the latter case, the different vectors are 
5 delivered concurrently to recipient cells to maximize cotransformation. 

The choice of the particular DNA segments to be delivered to the recipient cells often 
will depend on the purpose of the transformation. One of the major purposes of transformation 
of crop plants is to add some commercially desirable, agronomically important traits to the 
plant. Such traits include, but are not limited to, herbicide resistance or tolerance; insect 

10 resistance or tolerance; disease resistance or tolerance (viral, bacterial, fungal, nematode); stress 
tolerance and/or resistance, as exemplified by resistance or tolerance to drought, heat, chilling, 
freezing, excessive moisture, salt stress; oxidative stress; increased yields; food content and 
makeup; physical appearance; male sterility; drydown; standability; prolificacy; starch quantity 
and quality; oil quantity and quality; protein quality and quantity; amino acid composition; and 

15 the like. One may desire to incorporate one or more genes conferring any such desirable trait or 
traits, such as, for example, a gene or genes encoding herbicide resistance. 

In certain embodiments, the present invention contemplates the transformation of a 
recipient cell with PLACs comprising more than one exogenous gene. As used herein, an 
"exogenous gene," is a gene not normally found in the host genome in an identical context. By 

20 this, it is meant that the gene may be isolated from a different species than that of the host 
genome, or alternatively, isolated from the host genome but operably linked to one or more 
regulatory regions which differ from those found in the unaltered, native gene. Two or more 
exogenous genes also can be supplied in a single transformation event using either distinct 
transgene-encoding vectors, or using a single vector incorporating two or more gene coding 

25 sequences. For example, plasmids bearing the bar and aroA expression units in either 
convergent, divergent, or colinear orientation, are considered to be particularly useful. Further 
preferred combinations are those of an insect resistance gene, such as a Bt gene, along with a 
protease inhibitor gene such as pinll, or the use of bar in combination with either of the above 
genes. Of course, any two or more transgenes of any description, such as those conferring 

30 herbicide, insect, disease (viral, bacterial, fungal, nematode) or drought resistance, male 
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sterility, drydown, stability, prolificacy, starch properties, oil quantity and quality, or those 
increasing yield or nutritional quality may be employed as desired. 

(0 Herbicide Resistance 

The genes encoding phospWnothricin acetyltmnsferase (4 , md pa , x g| „ osa , e 
• •*-« EPSP synthase genes, the g ,yph osaK degradative ^ ^ 

glyphosate oxidoreducfcse, *» (ending a dehalcgerc.se enzyme that inactivates dalapon) 
berbtcide resistant (e.g., s „,f„„y lurea and imidazole) acetoiactate synthase, and fa. genes 
(encoding a mtrilase enzyme tha, degrades bromoxyni,) are good exampies of herbicide 
reststant genes for nse in .^formation. The W and pa, genes code for an enzyme 
Phosphmothricin ^transferase (PAT), which inactivates the herbicide phospWnothricin and 
prevents this compound from inhibiting gjutanrine synthetase enzymes. The enzyme 
5-enoIpynwylshikimate 3-phosphate synthase (EPSP Synthase), is normaliy inhibited by me 
herbicide N.(phosphonomemy,) gl yc i „c (g.yphosare, Hoover, genes a. known that encode 
glyphosate-resistan, EPSP synthase enzyme, These genes are particularly contended for 
use m plan, transformation. The *» gene encodes the enzyme dalapon dehalogenase and 
confers resistance to the herbicide da.apon. The **„ gene codes for a specific nitri.ase enzyme 
that converts bromoxyni! ,o a non-herbicidal degradation product. 

(ii) Insect Resistance 

Potemia. insect resistance genes that can be introduced inchrde BaciUus , hu nn g ie mi s 
crystal toxin genes or Bt genes (Watntd a, a,., ,985). Bt genes may provide reduce ,„ 
leptdopteran or coleopteran pests such as European C om Borer (ECB). Preferred Bt toxin 
genes for use in such embodiment inCude the O*, ft, and CryU ( c) genes. Endotoxin genes 
fom other species of* which ^ inxa ^ „ ^ 

employed in this regard. 

.t is contemplated tha, preferred B, genes for use in the transformauon protocols 
dtsdosed herein will be those i„ which the coding sequence has been modified to effect 
tncreased exprassion in ptats, arc, more particular*, in monoco, p,ants. Means for preparing 
« genes a. well known in the an and are disclosed in, for exampte, U.S. Patent No. 
5,500365 and U.S. Patent Number No. 5,689,052, each of the disclosures of which are 
specifically Incorporated herein by reference in their entitety. Examples of such modified Bt 
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toxin genes include a synthetic Bt CryIA(b) gene (Perlak et ai, 1991), and the synthetic 
CrylA(c) gene termed 1800b (PCT Application WO 95/06128). Some examples of other Bt 
toxin genes known to those of skill in the art are given in Table 1 below. 

Table 1 



Bacillus thuringiensis 5-Endotoxin Genes' 



New Nomenclature 


Old Nomenclature 


GenBank Accession 


CrylAa 


CrylA(a) 


Ml 1250 


CrylAb 


CrylA(b) 


M13898 


Cry 1 Ac 


CryIA(c) 


Ml 1068 


Cry 1 Ad 


CrylA(d) 


M73250 


CrylAe 


CrylA(e) 


M65252 


CrylBa 


CrylB 


X0671 1 


CrylBb 


ET5 


L32020 


Cry 1 Be 


PEG5 


Z46442 


CrylBd 


CryEl 


U70726 


CrylCa 


CrylC 


X07518 


CrylCb 


CrylC(b) 


M97880 


Cry 1 Da 


CrylD 


X54160 


CrylDb 


PrtB 


Z2251 1 


CrylEa 


CrylE 


X53985 


CrylEb 


CryIE(b) 


M73253 


Cry 1 Fa 


CrylF 


M63897 


CrylFb 


PrtD 


Z22512 


CrylGa 


PrtA 


Z22510 


Cry 1Gb 


CryH2 


U70725 


Cry 1 Ha 


PrtC 


Z22513 


CrylHb 




U35780 


Crylla 


CryV 


X62821 


Cryllb 


CryV 


U07642 


CrylJa 


ET4 


L32019 
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New Nomenclature 



Table 1 - Continued 
Old Nomenclature 



GenBank Accession 



CrylJb 

CrylK 

Cry2Aa 

Cry2Ab 

Cry2Ac 

Cry3A 

Cry3Ba 

Cry3Bb 

Cry3C 

Cry4A 

Cry4B 

Cry5Aa 

Cry5Ab 

Cry6A 

Cry6B 

Cry7Aa 

Cry7Ab 

Cry8A 

Cry8B 

Cry8C 

Cry9A 

Cry9B 

Cry9C 

Cry 1 OA 

CryllA 

CryllB 
Cryl2A 
Cryl3A 



ET1 

CryllA 

CryllB 

CryllC 

CrylllA 

CrylllB 

CryIIIB2 

CrylllD 

CrylVA 

CrylVB 

CryVA(a) 

CryVA(b) 

CryVIA 

CryVIB 

CryHIC 

CrylllCb 

CrylllE 

CrylllG 

CrylllF 

CrylG 

CrylX 

CrylH 

CrylVC 

CrylVD 

Jeg80 

CryVB 

CryVC 



U31527 

U28801 

M31738 

M23724 

X57252 

M22472 

X17123 

M89794 

X59797 

Y00423 

X07423 

L07025 

L07026 

L07022 

L07024 

M64478 

U04367 

U04364 

U04365 

U04366 

X58120 

X75019 

Z37527 

M12662 

M31737 

X86902 

L07027 

L07023 
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Table 1 - Continued 



New Nomenclature 


Old Nomenclature 


GenBank Accession 


Cryl4A 


CryVD 


U13955 


Cryl5A 


34kDa 


M76442 


Cryl6A 


cbm71 


X94146 


Cryl7A 


cbm71 


X99478 


Cryl8A 


CryBPl 


X99049 


Cryl9A 


Jeg65 


Y08920 


CytlAa 


CytA 


X03182 


CytlAb 


CytM 


X98793 


Cyt2A 


CytB 


Z14147 


Cyt2B 


CytB 


U52043 



a Adapted from: http://epunix.biols.susx.ac.uk/Home/Neil_Crickinore/Bt/index.htm 



Protease inhibitors also may provide insect resistance (Johnson et ai 9 1989), and will 
thus have utility in plant transformation. The use of a protease inhibitor II gene, pinll, from 
5 tomato or potato is envisioned to be particularly useful. Even more advantageous is the use of a 
pinll gene in combination with a Bt toxin gene, the combined effect of which has been 
discovered to produce synergistic insecticidal activity. Other genes which encode inhibitors of 
the insect's digestive system, or those that encode enzymes or co- factors that facilitate the 
production of inhibitors, also may be useful. This group may be exemplified by oryzacystatin 
10 and amylase inhibitors such as those from wheat and barley. 

Also, genes encoding lectins may confer additional or alternative insecticide properties. 
Lectins (originally termed phytohemagglutinins) are multivalent carbohydrate-binding proteins 
which have the ability to agglutinate red blood cells from a range of species. Lectins have been 
identified recently as insecticidal agents with activity against weevils, ECB and rootworm 
15 (Murdock et aL 9 1990; Czapla & Lang, 1990). Lectin genes contemplated to be useful include, 
for example, barley and wheat germ agglutinin (WGA) and rice lectins (Gatehouse et al y 1984), 
with WGA being preferred. 
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Genes conning the production of la^e or small p„| ypeptides active againsl insccB 
when intioduced into ,he inseo, pest, such as, e.g., ,ytic peptides , ^ h _ ^ ^ 
and venoms, fonn anomer aspee, of me invention. For examp,e, jt „ ^ ^ ^ 
expression of juvenile hormone esterase, directed towards specific insect pests, also may result 
» m msecticida, activity, or perhaps cause cessation of metamorphosis (Hammock e, a,., ,990). 

Transgenic piams expressing genes which encode enzymes tha, affect the integrity of 
the msec cuticle form ye. another aspect of the invention. Such genes include those encoding 
e g ., chatinaae, proteases, lipases and also genes for the production of neomycin, a compound 
mat ,„h,bits chitin synthesis, me introduction of any of which is contemplated to produce insect 
restsran, ptos. Genes that code for activities tha, affect insect molting> such „ ^ 
the production of ecoysteroid UDP-glucosyl transferase, also fal, whhin the scope of the useful 
transgenes of the present invention. 

Genes tha, code for enzymes ma, facilhate the production of compounds mat reduce the 
nutntiona, q uali.y of the hos, plan, ,o msec, posts also are encompassed by me presen, 
mvention. ,, may be possible, for i„s,ance, ,o confer insecicidal activity on a plan, by altering 
■* aero, composition. S,ero,s are obfcined by insect from ,heir die, and are used for hormone 
syn.hes.s and membrane ability. TWore Orations in plan, s,ero. composition by 
expansion of novel genes, ,g„ those ma, duectiy p rom „ te ,he production of u„desirab,e s,ero,s 
or mose ma, convert desirable sterols i„,o undesirable forms, couid have a negative effec, on 
■nsee, grow* and/or developmen, and hence endow me pte with msecticida, activhy 
lipoxygenases a. naturahy occurring p,an, enzymes ma, have boen shown ,o exhibit an,i- 
nutntiona, effect on inseo. and ,„ reduce me nmritional q uali,y of Urcir die,. Therefore 
htnher embodiment of tite invention concern transgenic plant with enhanced lipoxygenase 
activity which may be reaisran, to insect feeding. * "' 

Trip.acun, *,c<y,oUes is a species of grass ma, is resistan, ,o certain insecB , deluding 
con, roo, worm. „ is anticipated ma, genes encoding proteins ma, are tiaxic ,„ insect „, are 
mvoived in me biosynthesis of compounds toxic t0 ^ wju h . ^ ^ - 
ft* mese novel genes will be usefu, in herring resistence ,o insect. ,, is ^ te me 
baas of msec, resist ance in Wpsaam is genetic, because said resist has been nansfeneti 
,o Zea m ays via sexual crosses (Branson and Guss, ,972). 1, i s mrther anticipated «ha, outer 
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cereal, monocot or dicot plant species may have genes encoding proteins that are toxic to 
insects which would be useful for producing insect resistant plants. 

Further genes encoding proteins characterized as having potential insecticidal activity 
also may be used as transgenes in accordance herewith. Such genes include, for example, the 
5 cowpea trypsin inhibitor (CpTI; Hilder et ai, 1987) which may be used as a rootworm 
deterrent; genes encoding avermectin (Avermectin and Abamectin., Campbell, W.C., Ed., 1989; 
Ikeda et al, 1987) which may prove particularly useful as a corn rootworm deterrent; ribosome 
inactivating protein genes; and even genes that regulate plant structures. Transgenic plants 
including anti-insect antibody genes and genes that code for enzymes that can convert a non- 
10 toxic insecticide (pro-insecticide) applied to the outside of the plant into an insecticide inside 
the plant also are contemplated. 

(iii) Environment or Stress Resistance 

Improvement of a plants ability to tolerate various environmental stresses such as, but 
not limited to, drought, excess moisture, chilling, freezing, high temperature, salt, and oxidative 

15 stress, also can be effected through expression of novel genes. It is proposed that benefits may 
be realized in terms of increased resistance to freezing temperatures through the introduction of 
an "antifreeze" protein such as that of the Winter Flounder (Cutler et aL, 1989) or synthetic 
gene derivatives thereof. Improved chilling tolerance also may be conferred through increased 
expression of glycerol-3-phosphate acetyltransferase in chloroplasts (Wolter et aL, 1992). 

20 Resistance to oxidative stress (often exacerbated by conditions such as chilling temperatures in 
combination with high light intensities) can be conferred by expression of superoxide dismutase 
(Gupta et al„ 1993), and may be improved by glutathione reductase (Bowler et al y 1992). Such 
strategies may allow for tolerance to freezing in newly emerged fields as well as extending later 
maturity higher yielding varieties to earlier relative maturity zones. 

25 It is contemplated that the expression of novel genes that favorably effect plant water 

content, total water potential, osmotic potential, and turgor will enhance the ability of the plant 
to tolerate drought. As used herein, the terms "drought resistance" and "drought tolerance" are 
used to refer to a plants increased resistance or tolerance to stress induced by a reduction in 
water availability, as compared to normal circumstances, and the ability of the plant to function 

30 and survive in lower- water environments. In this aspect of the invention it is proposed, for 
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that the egression of genes encoding for fc biMynthesjs 
sota.es, such as po, y „, con, P ou„da, may tapan J J 

- : p encodtag f„, ^,0,^^ del)ydrog _ ; ^ 

b-h of wtach have heen we,, documented . protectjve compomis ^ P ^ 

of sues, Mannito, accumutation m « tobacco ^ ^ J j££ 

res* mdtca* tha, piants expressing high ,eve,s of this ^ are able ,„ 'J^Z 
10 Wlted osmotic stress <Tarezynskie, a /., ,992, ,993). '» tolerate an 

SimitaHy, the efficacy of other metahontes i„ protect^ either enzyme mnction <„ 
.anopme or P ro P io„ic acid) „ r mOTbrane ^ (e g ^ 
ootms « „, ,939), a„ d ^ rf ^ fc 

compounds mtgh, confer drought resist in a manner simitar to or compliments^ 
■5 mannttof Odter exam P ,es of naturany occtmrng metres that are osmotica,,, ' i^ 
^ some direct pro tective effect ^ ^ ^ ^ 

i t B " 92) ~ B Leopo.d, 
.992), ratW (BemaNLugo and Leopold, ,9,2), prollne (Rensburg „ „ 

*> hetame, ononito, and pinito, (Vemon and Bohnert, ,992). Conttaue canopy 

~ reproduce rW durtng times of stress wi„ he augmented hy ul^ 2 
«„ of genes such as Aox COMro| , ing ^ 

above and oUter such compounds. Current* prefemd ge „ B which £ ^ 
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parts {i.e. Mundy and Chua, 1988; Piatkowski et al, 1990; Yamaguchi-Shinozaki et aL, 1992). 
Recently, expression of a Type-Ill LEA (HVA-1) in tobacco was found to influence plant 
height, maturity and drought tolerance (Fitzpatrick, 1993). In rice, expression of the HVA-1 
gene influenced tolerance to water deficit and salinity (Xu et aL, 1996). Expression of 
5 structural genes from all three LEA groups may therefore confer drought tolerance. Other types 
of proteins induced during water stress include thiol proteases, aldolases and transmembrane 
transporters (Guerrero et aL, 1990), which may confer various protective and/or repair-type 
functions during drought stress. It also is contemplated that genes that effect lipid biosynthesis 
and hence membrane composition might also be useful in conferring drought resistance on the 
10 plant. 

Many of these genes for improving drought resistance have complementary modes of 
action. Thus, it is envisaged that combinations of these genes might have additive and/or 
synergistic effects in improving drought resistance in plants. Many of these genes also improve 
freezing tolerance (or resistance); the physical stresses incurred during freezing and drought are 

15 similar in nature and may be mitigated in similar fashion. Benefit may be conferred via 
constitutive expression of these genes, but the preferred means of expressing these novel genes 
may be through the use of a turgor-induced promoter (such as the promoters for the turgor- 
induced genes described in Guerrero et aL, 1990 and Shagan et aL, 1993 which are 
incorporated herein by reference). Spatial and temporal expression patterns of these genes may 

20 enable plants to better withstand stress. 

It is proposed that expression of genes that are involved with specific morphological 
traits that allow for increased water extractions from drying soil would be of benefit. For 
example, introduction and expression of genes that alter root characteristics may enhance water 
uptake. It also is contemplated that expression of genes that enhance reproductive fitness 
25 during times of stress would be of significant value. For example, expression of genes that 
improve the synchrony of pollen shed and receptiveness of the female flower parts, Le, 9 silks, 
would be of benefit. In addition it is proposed that expression of genes that minimize kernel 
abortion during times of stress would increase the amount of grain to be harvested and hence be 
of value. 

30 Given the overall role of water in determining yield, it is contemplated that enabling 

plants to utilize water more efficiently, through the introduction and expression of novel genes, 
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Included amongst the PR proteins are (M, 3-glucanases, chitinases, and osmotin and other 
proteins that are believed to function in plant resistance to disease organisms. Other genes have 
been identified that have antifungal properties, e.g., UDA (stinging nettle lectin) and hevein 
(Broakaert et aL, 1989; Barkai-Golan et al. y 1978). It is known that certain plant diseases are 
5 caused by the production of phytotoxins. It is proposed that resistance to these diseases would 
be achieved through expression of a novel gene that encodes an enzyme capable of degrading or 
otherwise inactivating the phytotoxin. It also is contemplated that expression of novel genes 
that alter the interactions between the host plant and pathogen may be useful in reducing the 
ability of the disease organism to invade the tissues of the host plant, e.g., an increase in the 
10 waxiness of the leaf cuticle or other morphological characteristics. 

(v) Plant Agronomic Characteristics 

Two of the factors determining where crop plants can be grown are the average daily 
temperature during the growing season and the length of time between frosts. Within the areas 
where it is possible to grow a particular crop, there are varying limitations on the maximal time 

15 it is allowed to grow to maturity and be harvested. For example, a variety to be grown in a 
particular area is selected for its ability to mature and dry down to harvestable moisture content 
within the required period of time with maximum possible yield. Therefore, crops of varying 
maturities is developed for different growing locations. Apart from the need to dry down 
sufficiently to permit harvest, it is desirable to have maximal drying take place in the field to 

20 minimize the amount of energy required for additional drying post-harvest. Also, the more 
readily a product such as grain can dry down, the more time there is available for growth and 
kernel fill. It is considered that genes that influence maturity and/or dry down can be identified 
and introduced into plant lines using transformation techniques to create new varieties adapted 
to different growing locations or the same growing location, but having improved yield to 

25 moisture ratio at harvest. Expression of genes that are involved in regulation of plant 
development may be especially useful. 

It is contemplated that genes may be introduced into plants that would improve 
standability and other plant growth characteristics. Expression of novel genes in plants which 
confer stronger stalks, improved root systems, or prevent or reduce ear droppage would be of 
30 great value to the farmer. It is proposed that introduction and expression of genes that increase 
the total amount of photoassimilate available by, for example, increasing light distribution 
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A number of mutations were discovered in maize that confer cytoplasmic male sterility. 
One mutation in particular, referred to as T cytoplasm, also correlates with sensitivity to 
Southern corn leaf blight. A DNA sequence, designated TURF- 13 (Levings, 1990), was 
identified that correlates with T cytoplasm. It is proposed that it would be possible through the 
5 introduction of TURF- 13 via transformation, to separate male sterility from disease sensitivity. 
As it is necessary to be able to restore male fertility for breeding purposes and for grain 
production, it is proposed that genes encoding restoration of male fertility also may be 
introduced. 

(viii) Negative Selectable Markers 

10 Introduction of genes encoding traits that can be selected against may be useful for 

eliminating undesirable linked genes. It is contemplated that when two or more genes are 
introduced together by cotransformation that the genes will be linked together on the host 
chromosome. For example, a gene encoding a Bt gene that confers insect resistance on the 
plant may be introduced into a plant together with a bar gene that is useful as a selectable • 

1 5 marker and confers resistance to the herbicide Liberty® on the plant. However, it may not be 
desirable to have an insect resistant plant that also is resistant to the herbicide Liberty®. It is 
proposed that one also could introduce an antisense bar gene that is expressed in those tissues 
where one does not want expression of the bar gene, e.g., in whole plant parts. Hence, although 
the bar gene is expressed and is useful as a selectable marker, it is not useful to confer herbicide 

20 resistance on the whole plant. The bar antisense gene is a negative selectable marker. 

It also is contemplated that negative selection is necessary in order to screen a 
population of transformants for rare homologous recombinants generated through gene 
targeting. For example, a homologous recombinant may be identified through the inactivation 
of a gene that was previously expressed in that cell. The antisense gene to neomycin 

25 phosphotransferase II (NPT II) has been investigated as a negative selectable marker in tobacco 
(Nicotiana tabacum) and Arabidopsis thaliana (Xiang. and Guerra, 1993). In this example, 
both sense and antisense NPT II genes are introduced into a plant through transformation and 
the resultant plants are sensitive to the antibiotic kanamycin. An introduced gene that integrates 
into the host cell chromosome at the site of the antisense NPT II gene, and inactivates the 

30 antisense gene, will make the plant resistant to kanamycin and other aminoglycoside 
antibiotics. Therefore, rare, site-specific recombinants may be identified by screening for 
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negative selectable marker may make it possible to eliminate the autonomous element during 
the breeding process. 

(ix) Non-Protein-Expressing Sequences 

DNA may be introduced into plants for the purpose of expressing RNA transcripts that 
5 function to affect plant phenotype yet are not translated into protein. Two examples are 
antisense RNA and RNA with ribozyme activity. Both may serve possible functions in 
reducing or eliminating expression of native or introduced plant genes. However, as detailed 
below, DNA need not be expressed to effect the phenotype of a plant. 

1 . Antisense RNA 

1 0 Genes may be constructed or isolated, which when transcribed, produce antisense RNA 

that is complementary to all or part(s) of a targeted messenger RNA(s). The antisense RNA 
reduces production of the polypeptide product of the messenger RNA. The polypeptide product 
may be any protein encoded by the plant genome. The aforementioned genes will be referred to 
as antisense genes. An antisense gene may thus be introduced into a plant by transformation 

1 5 methods to produce a novel transgenic plant with reduced expression of a selected protein of 
interest. For example, the protein may be an enzyme that catalyzes a reaction in the plant. 
Reduction of the enzyme activity may reduce or eliminate products of the reaction which 
include any enzymatically synthesized compound in the plant such as fatty acids, amino acids, 
carbohydrates, nucleic acids and the like. Alternatively, the protein may be a storage protein, 

20 such as a zein, or a structural protein, the decreased expression of which may lead to changes in 
seed amino acid composition or plant morphological changes respectively. The possibilities 
cited above are provided only by way of example and do not represent the full range of 
applications. 

2. Ribozymes 

25 Genes also may be constructed or isolated, which when transcribed, produce RNA 

enzymes (ribozymes) which can act as endoribonucleases and catalyze the cleavage of RNA 
molecules with selected sequences. The cleavage of selected messenger RNAs can result in the 
reduced production of their encoded polypeptide products. These genes may be used to prepare 
novel transgenic plants which possess them. The transgenic plants may possess reduced levels 

30 of polypeptides including, but not limited to, the polypeptides cited above. 
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Ribozyn.es are RNA-protein complexes that cleave nucleic acids in a site-specific 
fasmon. Isozymes have specific catalytic domains that possess endonuclease activity (Kim 
and Cech, 1987; Gerlach 1987; Forster and Symons, 1987). For example, a large number 
of nbozymes accelerate phosphoester transfer reactions with a high degree of specificity, often 
cleavmg only one of several phosphoesters in an oligonucleotide substrate (Cech et al 1981- 
Miche, and Westhof, ,990; Reinhold-Hurek and Shub, 1992). This specificity has been 
attnbuted to the requirement that the substrate bind via specific base-pairing interactions to the 
internal guide sequence (»IGS«) of the ribozyme prior to chemical reaction. 

Ribozyme catalysis has primarily been observed as part of sequence-specific 
cleavage/ligation reactions involving nucleic acids (Joyce, 1989; Cech etal, ,981) For 
example, U. S. Patent 5,354,855 reports that certain nbozymes can act as endonucleases with a 
sequence specificity greater than that of known ribonucleases and approaching that of the DNA 
restriction enzymes. 

Several different ribozyme motifs ha ve ^ ^ ^ d ^ 

(Symons, ,992). Examples inelude sequences from me Group ! self splicing introns including 
Tobacco Ringspo. Virus (Prcdy ,986). Avocado Sunblotch Viroid (Palukaitis e, a, 

■979; Symons, ,981), and Lucerne Transient Stieak Virus (Forster and Symons 1987)' 
Sequences from these and related viruses ate tefetred «„ as hammerhead ribozyme based on a 
predicted folded secondary structure. 

Outer suitable ribozymes include sequences from RNase P with RNA cleavage activity 
(Yuan „./., ,992, Yuan and Altmat, ,994. U. S. Patents 5,,68,053 am, 5,624,824), haitpin 
rtbozyme stntctures (BerzaLHenanz e.ai, ,992; Chowrita e,ai, ,993) and Hepatitis Delta 
vtrua based ribozymes (U, S, Patent 5,625,047). rhe genera, design and optimization of 
rtbozyme ditected RNA cleavage activity has been discussed in detail (Hase.off and Gerlach 
,988, Symons, ,992, Chowrira « al., ,994; Thompson « al., 1995). 

The other variable on ribozyme design is the seleotion of a cleavage site on a given 
target RNA. Ribozymes are targeted «o a given sequence by vinrte of annealing ,o a site by 
enmphmentary base pair interaction. Two stretches of homology are required for mis 
tergeting. These snetehes of homo,ogous sequences flank the catalytic ribozyme stntctiue 
defined above. Each stretch of homologous sequence can vary in iength from 7 to ,5 
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nucleotides. The only requirement for defining the homologous sequences is that, on the target 
RNA, they are separated by a specific sequence which is the cleavage site. For hammerhead 
ribozyme, the cleavage site is a dinucleotide sequence on the target RNA is a uracil (U) 
followed by either an adenine, cytosine or uracil (A,C or U) (Perriman et aL 9 1992; Thompson 
5 etal, 1995). The frequency of this dinucleotide occurring in any given RNA is statistically 3 
out of 16. Therefore, for a given target messenger RNA of 1000 bases, 187 dinucleotide 
cleavage sites are statistically possible. 

Designing and testing ribozymes for efficient cleavage of a target RNA is a process well 
known to those skilled in the art. Examples of scientific methods for designing and testing 
10 ribozymes are described by Chowrira etal, (1994) and Lieber and Strauss (1995), each 
incorporated by reference. The identification of operative and preferred sequences for use in 
down regulating a given gene is simply a matter of preparing and testing a given sequence, and 
is a routinely practiced "screening" method known to those of skill in the art. 

3. Induction of Gene Silencing 

15 It also is possible that genes may be introduced to produce novel transgenic plants 

which have reduced expression of a native gene product by the mechanism of co-suppression. 
It has been demonstrated in tobacco, tomato, and petunia (Goring et al 9 1991; Smith et al. 9 
1990; Napoli et al, 1990; van der Krol et al. 9 1990) that expression of the sense transcript of a 
native gene will reduce or eliminate expression of the native gene in a manner similar to that 

20 observed for antisense genes. The introduced gene may encode all or part of the targeted native 
protein but its translation may not be required for reduction of levels of that native protein. 

4. Non-RNA-Expressing Sequences 

DNA elements including those of transposable elements such as Ds, Ac, or Mu, may be 
inserted into a gene to cause mutations. These DNA elements may be inserted in order to 

25 inactivate (or activate) a gene and thereby "tag" a particular trait. In this instance the 
transposable element does not cause instability of the tagged mutation, because the utility of the 
element does not depend on its ability to move in the genome. Once a desired trait is tagged, 
the introduced DNA sequence may be used to clone the corresponding gene, e.g., using the 
introduced DNA sequence as a PCR primer together with PCR gene cloning techniques 

30 (Shapiro, 1983; Dellaporta et al. 9 1988). Once identified, the entire gene(s) for the particular 
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trait, mcluding control or regulatory regions where desired, may be isolated, cloned and 
mandated as desired. The utility of DNA elements introduced into an organism for purposes 
of gene tagging is independent of the DNA sequence and does not depend on any biological 
actmty of the DNA sequence, i.e., transcription into RNA or translation into protein. The sole 
function of the DNA element is to disrupt the DNA sequence of a gene. 

It is contemplated that unexpressed DNA sequences, including novel synthetic 
sequences, could be introduced into cells as proprietary '•labels" of those cells and plants and 
seeds thereof. It would not be necessary for a label DNA element to disrupt the function of a 
gene endogenous to the host organism, as the sole function of this DNA would be to identify 
the ongin of the organism. For example, one could introduce a unique DNA sequence into a 
plant and this DNA element would identify all cells, plants, and progeny of these cells as 
havmg arisen from that labeled source. It is proposed that inclusion of label DNAs would 
enable one to distinguish proprietary germplasm or germplasm derived from such, from 
unlabelled germplasm. 

Another possible element which may be introduced is a matrix attachment region 
element (MAR), snch as the chicken .ysozyme A element (Stief, 1989), which can be 
postnoned aronnd an expressible gene of interest ,o effect an increase in overal! exptession of 
the gene and diminish position dependent effects upon incotporation into the plan, genome 
(Stief el al, 1989; Phi-Van el at., 1990). 

20 XVIII. Example 

Tne following examples are included to demonstrate preferred embodiments of the 
tnvenuon. „ should be appreciated by those of skilled the art una. me techniques disclosed in 
me examples which follow represent techniques discovered by me invemora ,o ftmcion well in 
•he practice of me invention, attd thus can be considered «o constitute prefetred modes for its 
practice. However, mose of skill in .he art should, in Ugh. of the presen, disclosure, appreciate 
ma. many changes can be made in me specific embodiments w hi ch are disclosed and still 
obtam a like or simitar ream, without iepmiag fom ^ conc<w spirj , ^ ^ rf ^ 
mvention. More specifically, it will be apparent ma. certain agen,s which are bom chemically 
and physiologically related may be substituted for the agents described herein whiie me same or 
suralar results would be achieved. All such similar substittttes arm modifications apparent ,„ 
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those skilled in the art are deemed to be within the spirit, scope and concept of the invention as 
defined by the appended claims. 

EXAMPLE 1 

Generation of an Arabidopsis thaliana Mapping Population 

5 To generate a pollen donor plant, two alleles of qrtl were crossed to one another. The 

qrtl-1 allele was in the Landsberg ecotype background and the qrtl-2 allele was in the Columbia 
ecotype background. The Landsberg ecotype was readily discernible from the Columbia 
ecotype because it carries a recessive mutation, erecta, which causes the stems to thicken, 
infloresences to be more compact, and the leaves to be more rounded and small than wildtype. 

10 To utilize this as a marker of a donor plant, qrtl-2 pollen was crossed onto a qrtl-1 female 
stigma. The F, progeny were heterozygous at all molecular markers yet the progeny retain the 
quartet phenotype of a tetrad of fused pollen grains. In addition, progeny display the ERECTA 
phenotype of the Columbia plant. This visible marker serves as an indication that the crossing 
was successful in generating plants segregating ecotype specific markers. Further testing was 

1 5 done to the donor plants by performing PCR analysis to insure that progeny were heterozygous 
at molecular loci. 

Due to the fact that the pollen grains can not be directly assayed for marker segregation 
and because of the desire to create a long-term resource available for multiple marker assays, it 
was necessary to cross individual tetrads generated by the donor plant. This created sets of 

20 progeny plants which yielded both large quantities of tissue and seed. These crosses were 
accomplished efficiently by generating a recipient plant homozygous for glaborous 1-1 and 
male sterility. The recessive visible mutant, glaborousl-1, was chosen to guard against the 
possibility of the recipient plant self-fertilizing and the progeny being mistaken for tetrad 
plants. Pollen inviability through male sterile-1 was introduced to guard against the recipient 

25 plant self-fertilizing. Due to the fact that the homozygous plant does not self, a stock seed 
generated by a heterozygous male sterility 1 plant needs to be maintained from which sterile 
recipient plants can be selected. A large resource of seed segregating the optimal sterile and 
visually marked recipient plant was obtained. 
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EXAMPLE 2 
Tetrad Pollinations 

Tetrad pollination, were carried out as follows. A matnre flower was removed from the 
donor pta and tapped upon a glass mictoscope slide ,„ release mature tetrad poUen grains 
5 Thts slide was men planed under a 20-40* Zeiss dissenting microscope. To isolate individual 
tetrad pollen grains, a smai. wooden down! was used to which an eyebrow hair with rubber 
cement was mourned. Using the light microscope, a tetrad pollen unit was chosen and touched 
to the eyebrow hair. The tetrad preferentially adhered ,„ the eyebrow hair and was thus lifted 
from the microscope slide and transported the recipient plan, stigmatic surface. The transfer 
10 was carried out without the use of me microscope, and the eyebrow hair with adhering tetrad 
was then placed against the recipient stigmatic surface and the hair was manually dragged 
across the stigma surface. The tetrad men preferentially adhered to me stigma of the recipient 
and the cross pollination was completed. 

Preferably, an additional backcross of the gnl-2 parent was used ,„ increase the 
15 polhnat.on efficiency, which was increased ,„ >80% successful seed se, production. Initially 
57 tetrad seed sets consisting of 3-4 seeds each, were collected. Plants were grow „ from ^ 
.etrad seed sets, and tissue was collected. DNA was extracted fern a small portion of the 
stored tissue for PGR based segregation ana.ysis. Additional the segregation of the visib ,e 
ereaa phenotypo was scored. When the plants se, seed, the seed was collected as a source for 
20 the larger amounts of DNA required to analyze RFLP segregation by Southern blotting. 

EXAMPLE 3 

Genetic Mapping of Centromeres 

To map centromeres, F, plants which were heterozygous for hundreds of polymorphic 
DNA markers were generated by ctossing qmlel mutants from ^ ^ 
25 ecotypes (Chang „ a , 1988; ^ 1994; ^ ^ ^ ^ ^ ^ ^ 

plants, genetic markers segregate in a 2:2 ratio (FIG. 6; Preuss e, at. 1994). The segregation of 
mtukers was then determined in p,a„ K which were gM by ^ ^ ^ ^ ^ 
F, plants onto a Landsberg homozygofe. The genotype of me poUen grains within a tetrad was 
mferred from the genotype of the progeny, .nitially, seeds were generated from greater man 
100 succnssfitl tetrad pollinations, and tissue and seeds were col.ec.ed from 57 of these This 
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provided sufficient material for PCR, as well as seeds necessary for producing the large 
quantities of tissue required for Southern hybridization and RFLP mapping. In order to obtain a 
more precise localization of the centromeres the original tetrad population was increased from 
57 tetrads to over 388 tetrads. Additional tetrads may be collected to provide even better 
5 resolution. 

PCR analysis was performed to determine marker segregation. To account for the 
contribution of the Landsberg background from the female parent, one Landsberg complement 
from each of the four tetrad plants was subtracted. As shown in FIG. 5, markers from sites 
spanning the entire genome were used for pair-wise comparisons of all other markers. 
10 Tetratypes indicate a crossover between one or both markers and their centromeres where as 
ditypes indicate the absence of crossovers (or presence of a double crossover). 

Thus, at every genetic locus, the resulting diploid progeny was either L/C or C/C. The 
map generated with these plants is based solely on male meioses, unlike the existing map, 
which represents an average of recombination's in both males and females. Therefore, several 
15 well-established genetic distances were recalculated and thus will determine whether 
recombination frequencies are significantly altered. 

The large quantities of genetic data generated by the analysis must be compared pair- 
wise to perform tetrad analysis. AH of the data was managed in a Microsoft Excel spread-sheet 
format, assigning Landsberg alleles a value of "1" and Columbia alleles a value of "0". Within 

20 a tetrad, the segregation of markers on one chromosome was compared to centromere-linked 
reference loci on a different chromosome (see Table 2 below). Multiplying the values of each 
locus by an appropriate reference, and adding the results for each tetrad easily distinguished 
PD, NPD, and TT tetrads with values of 2, 0, and 1, respectively. By scoring more than 53 
PCR-based genetic markers distributed across the genome, all five Arabidopsis thaliana 

25 centromeres were mapped to small intervals (FIG. 3A-3E). Additionally, for each centromeric 
interval, a number of useful recombinants were identified. 
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Table 2 

Scoring protocol for tetratypes. 

Individual 

members Reference 
of a tetrad Locus 1 Locus Locus 2 



Reference 



Locus 



A 


1 


x 1= l 


0 


x 1 = 0 


0 


x 1 = 0 


B 


1 


x 1= l 


0 


x 1 = 0 


1 


x 1 = l 


C 


0 


x 0= 0 


1 


x 0= 0 


0 


x 0= 0 


D 


0 


x 0= 0 


1 


x 0= 0 


1 


x 0= 0 






2 




0 




1 






PD 




NPD 




TT 



EXAMPLE 4 
Mapping Results: Arabidopsis Chromosomes 1-5 

The centromere on chromosome 1 was mapped between mi342 (56.7 cM) and T27K12 
(59.. cM). A more refined position places the centromere between the marker T22C23 (-58 5 

cM) and T27K12 (59.1 cM). However since T2?r?l i* • , 

) ever, since 1 22C23 is not precisely mapped the centromere 

may be between the publicly available markers mi342 and T27K12. 

The centromere on chromosome 2 was mapped between mi310 (18.6 cM) and g4133 
(23.8 cM). Within this interval the centromere may be between the following publicly 
available marker pairs: mi3,0 and mi421, mi421 and g 4532, g 4532 and SEP2A, and SEP2A 
andg4133. 

The centtomere on chromes 3 was mapped between atpox (48.6 cM) and ve02 1 
(54.7 cM). A more refined position p,aces the centromere between the marker atpox (48 6 cM) 
and ! 9,F,T7 (-54.2 cM) however, since 9,F,T7 is no, precise* mapped, the centromere may 
be between the Mowing pnbiiCy availab.e marker pairs: atpox and zun2, zim2 and mi79b 
mi79b and RCEN3, RCEN3 and ASD, ASD and a-1, a-1 and ,04. 09, and ,04, 09 and ve021. ' 

Tie centiomere on chromosome 4 was mapped between mi233 (18.8 cM) and mi!67 
(21 .5 cM). A mo re refined position places the cenfrome* between the marker, mi233 a,d 
F.3HI4 however since F.3H.4 is no, precise* mapped, me cannons may be between fhe 
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following publicly available marker pairs: mi233 and g2616, g2616 and mi306, rni306 and 
m506, m506 and ngal2, ngal2 and BIO200, BIO200 and mi87, mi87 and CI lAth, CI lAth and 
m456A, and m456A and mi 167. 

The centromere on chromosome 5 was mapped between nga76 (71.6 cM) and PhC 
5 (74.3 cM). Within this interval the centromere may be between the following publicly 
available marker pairs: nga76 and mi291b, mi291b and CMsl, and CMsl and PhyC. 

All of the above markers and genetic positions {i.e. cM values) correspond to the Lister 
and Dean Recombinant Inbred Genetic map, available on-line at: http://genome- 
www3.stanford.edu/atdb_welcome. html. 

10 EXAMPLES 

Construction of Artificial Plant Chromosomes 

Plant artificial chromosomes are constructed by combining the previously isolated 
essential chromosomal elements. Exemplary artificial chromosomes include those designed to 
be "shuttle vectors"; Le., they can be maintained in a convenient host (such as E. coli, 
1 5 Agrobacterium or yeast) as well as plant cells. 

An artificial chromosome can be maintained in E. coli or other bacterial cells as a 
circular molecule by placing a removable stuffer fragment between the telomeric sequence 
blocks. The stuffer fragment is a dispensable DNA sequence, bordered by unique restriction 
sites, which can be removed by restriction digestion of the circular DNAs to create linear 

20 molecules with telomeric ends. The linear PLAC can then be isolated by, for example, gel 
electrophoresis. In addition to the stuffer fragment and the plant telomeres, the artificial 
chromosome contains a replication origin and selectable marker that can function in plants to 
allow the circular molecules to be maintained in bacterial cells. The artificial chromosomes 
also include a plant selectable marker, a plant centromere, and a plant ARS to allow replication 

25 and maintenance of the DNA molecules in plant cells. Finally, the artificial chromosome 
includes several unique restriction sites where additional DNA sequence inserts can be cloned. 
The most expeditious method of physically constructing such an artificial chromosome, i.e., 
ligating the various essential elements together for example, will be apparent to those of 
ordinary skill in this art. 
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A number of artificial chromosome vectors have been designed by the current inventors 
and are disclosed herein for the purpose of illustration (FIGs. 7A-7H). These vectors are not 
limiting however, as it will be apparent to those of skill in the art that many changes and 
alterations may be made and still obtain a functional vector. 

All of the compositions and methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 
compositions and methods and in the steps or in the sequence of steps of the method described 
herein without departing from the concept, spirit and scope of the invention. More specifically 
«t will be apparent that certain agents which are both chemically and physiologically related 
may be substituted for the agents described herein while the same or similar results would be 
ach,eved. All such similar substitutes and modifications apparent to those skilled in the art are 
deemed to be within the spirit, scope and concept of the invention as defined by the appended 
claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(A) NAME: Arch Development Corporation 

(B) STREET: 1101 East 58th Street 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60637 

(A) NAME : Greg Copenhaver 

<B) STREET: 1028 S. Elmwood 

(C) CITY: Oakpark 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60304 

(A) NAME : Daphne Preuss 

(B) STREET: 5807 S. Dorchester #2E 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 60637 

(ii) TITLE OF INVENTION: PLANT ARTIFICIAL CHROMOSOME COMPOSITIONS AND 
METHODS 

(iii) NUMBER OF SEQUENCES: 44 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/073,741 

(B) FILING DATE: 05-FEB-1998 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/048,451 

(B) FILING DATE: 03-JUN-1997 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
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CGCCAAAGAC TACGAAATGA TC 

(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 
ATAATAGATA AAGAGCCCCA CAC 

(2) INFORMATION FOR SEQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGTCTGGTT ATGCCGTGAA G 

(2) INFORMATION FOR SEQ ID NO: 4 : 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTTTTACTTA GTCCAATGGT AG 

(2) INFORMATION FOR SEQ ID NO: 5 : 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AAATGGCCAA CGATCAGAAG AATA 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAAGTCCGGC ATGTTATCAC CCAAG 25 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAAGTCGCAA ACGGAAAATG 20 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAACTACGCC TAACCACTAT TCTC 24 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GAAGTACAGC GGCTCAAAAA GAAG 



24 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTGCTGCCAT GTAATACCTA AGTG 

24 

(2) INFORMATION FOR SEQ ID NO: 11 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11 . 
GGCTACTGGT CAAATCAAAT CATTC 



25 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAATCTTTGC AAACGAGTGG 

(2) INFORMATION FOR SEQ ID NO: 13 . 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 . 
GCGGCTGGAT GATCTCCACC TC 



20 



22 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTACCCCGCA GGAAAAAGTA TG 22 



(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ACTTCATCAC TTGCGGGACT G 21 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGCCCAAGAA GCCCACAACA C 21 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GAGGATGTCC AAGGCGTCTG TGC 



23 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGGGATTTCG GTTGTCTGTC 



(2) INFORMATION FOR SEQ ID NO : 19 . 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID 
TAACGTTCCC ACATGAGC 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID 
AACTCTGTAC CTGCTGGA 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION; SEQ ] 
AGTCGATGTC TAGGCTCTTC 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTTCCATTTC TTGATTTAGT TC 22 



(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ACTAAGGCCT GTGTTGATGT TTCTC 25 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AACCGCTTCC CATTCGTCTT C 21 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



GGCGACCTTG GACCTGTATA CG 



22 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AACCGCCATT TTCATTTCTA TC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TCTGCGGTGG GAATACAAAA G 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 28: 
GCCATCATTC CCCGGTTCTC ATAAG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CCCCTCCCGC CCTAAACCTA C 



22 



21 



25 



21 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TTCCGCTACA TGGCCTTCTA CCTTG 25 



(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CGTATTCCCC TGAAAAGTGA CCTG 24 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ACATCCGGCC TTCCCATTG 19 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



TTTGCACCGC CTATGTTACC 



20 



WO 98/55637 



84 

(2) INFORMATION FOR SEQ ID NO: 34 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO 
GAGGACGTTT TGCAGAGTG 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ] 
AGTGGACGCC TTCTTCAATG TG 



(2) INFORMATION FOR SEQ ID NO: 36 - 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID 
TGGTCCGTCG TAGGGCAAC 



(2) INFORMATION FOR SEQ ID NO: 37 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ] 
CAAAACCAAA TCCGCGAAGA AC 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AGTGGCCAGC CTTCTTAACA TACC 24 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCGGGGAAGA AAGCGTGAAT C 21 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CCCGGAGTTA CAGCCCTTAT GATG 24 



(2) INFORMATION FOR SEQ ID NO : 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



TTGGGGGATT GGTCAGAAG 



19 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
ATATGTTGCA ACTTAGAATC AG 

22 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TCTCGTTCTG ATGGCTCCTG TG 

22 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GTGTAACCGG TGATACTCTC TCGCC 

25 
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CLAIMS 

1. A recombinant DNA construct comprising a functional Arabidopsis thaliana 
centromere. 

2. The recombinant DNA construct of claim 1, which additionally comprises a telomere 
5 functional in plants. 

3. The recombinant DNA construct of claim 2, wherein the telomere is an Arabidopsis 
thaliana telomere. 

4. The recombinant DNA construct of claim 2, wherein the telomere is a yeast telomere. 

5. The recombinant DNA construct of claim 1, which additionally comprises an 
10 autonomous replicating sequence (ARS) functional in plants. 

6. The recombinant DNA construct of claim 5, wherein said ARS is an Arabidopsis 
thaliana ARS. 

7. The recombinant DNA construct of claim 1 , which additionally comprises a selectable 
marker gene. 

15 8. The recombinant DNA construct of claim 7, which is capable of being maintained as a 
chromosome, wherein said chromosome is stably transmitted in dividing cells. 

9. A plasmid comprising the recombinant DNA construct of claim 7. 

10. The plasmid of claim 9, wherein said plasmid further comprises an origin of replication 
and a selection marker that function in bacteria. 

20 11. The plasmid of claim 1 0, wherein said bacteria is E. coli. 

12. The plasmid of claim 9, wherein said plasmid further comprises an origin of replication 
and a selection marker that function in yeast. 
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13. The plasmid of claim 12, wherein said yeast is S. cerevisiae. 
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24. The construct of claim 14, wherein said structural gene is selected from the group 
consisting of a hormone gene; an enzyme gene; an interleukin gene; a clotting factor gene; a 
cytokine gene; an antibody gene; and a growth factor gene. 

25. The construct of claim 24, wherein said construct is capable of expressing said desired 
5 gene sequence. 

26. The construct of claim 25, wherein said construct is capable of expressing said desired 
gene in a prokaryote. 

27. The construct of claim 25, wherein said construct is capable of expressing said desired 
gene in a eukaryote. 

10 28. The construct of claim 27, wherein said eukaryote is a higher eukaryote. 

29. The construct of claim 28, wherein said higher eukaryote is a plant. 

30. A host cell transformed with the construct of claim 14. 

3 1 . The host cell of claim 30, wherein said cell is a prokaryotic cell. 

32. The host cell of claim 30, wherein said cell is a eukaryotic cell. 
15 33. The host cell of claim 32, wherein said cell is a yeast cell. 

34. The host cell of claim 32, wherein said cell is a higher eukaryotic cell. 

35. The host cell of claim 34, wherein said cell is a plant cell. 

36. A plant artificial chromosome vector comprising an autonomous replicating sequence, 
two telomere sequences, a first structural gene, an Arabidopsis thaliana centromere sequence, 

20 and at least one selectable marker. 
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37. The p la nt anificia, chromosome vector of claim 36, wherein said te l omer e sequences 
are Arabidopsis thaliana telomere sequences. 

38. The plant artificial chromosome vector of H»im u ■ 
areyeasttelomereseouences. 

5 39. Elated nuclei 

2 The isolated nucleic acid sequence of claim 39, narther defined as comprising an 
Arabidopsis thaliana chromosome 1 centromere. 

L d rr Iated nUCldC S6qUenCe ° f C,3im 4 °' Wh6rein S3id acid sequence is 

flanked by the genetic markers mi342 and T27K12. 

'0 42. The isolated nucleic acid sequence of claim 41 »,h • ^ 

filrt . . . J „ ^uenceot claim 41, wherein said nucleic acid sequence is 

further defined as flanked by the genetic markers T22C23 and T27K12. 

43 The isolated nucleic acid sequence of claim 39, fiinher defined as comprising an 
Arabidopsis thaliana chromosome 2 centromere. 

15 L Hrr Iated OUC,eiC ^ SCqUenCe ° f daim 43 ' Wherdn ^ ™«c -*! — is 
15 flanked by the genetic markers mi310 and g4133. 



45. The isolated nucleic acid 
further defined as flanked by a pair of 



sequence of claim 44, wherein said nucleic acid sequence is 



gCnetlC markers se,ected from the group consisting of 
-310 and mi421, mi421 and g4532, g4532 and SEP2A, and SEP2A and g 4.33. 

20 ^ iSOlat6d nUCldC ^ S6qUenCe ° f C,3im 39 ' defi -d as comprising an 

20 ^^rt^^ mpnsingan 

47. The isolated nucleic acid sequence of claim a< u ■ 

flfln ^ K t . q 1 ° laim 46 ' wherein said nucleic acid sequence is 

flanked by the genetic markers atpox and ve02 1 . 
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48. The isolated nucleic acid sequence of claim 46, wherein said nucleic acid sequence is 
flanked by a pair of genetic markers selected from the group consisting of atpox and zim2, zim2 
and mi79b, mi79b and RCEN3, RCEN3 and ASD, ASD and a-1, a-1 and t04109, and t04109 
and ve021. 

5 49. The isolated nucleic acid sequence of claim 39, further defined as comprising an 
Arabidopsis thaliana chromosome 4 centromere. 

50. The isolated nucleic acid sequence of claim 49, wherein said nucleic acid sequence is 
flanked by the genetic markers mi233 and mi 167. 

5 1 . The isolated nucleic acid sequence of claim 49, wherein said nucleic acid sequence is 
10 flanked by a pair of genetic markers selected from the group consisting of mi233 and g2616, 

g2616 and mi306, mi306 and m506, m506 and ngal2, nga!2 and BIO200, BIO200 and mi87, 
mi87 and CI 1 Ath, CI 1 Ath and m456A, and m456A and mil67. 

52. The isolated nucleic acid sequence of claim 39, further defined as comprising an 
Arabidopsis thaliana chromosome 5 centromere. 

15 53. The isolated nucleic acid sequence of claim 52, wherein said nucleic acid sequence is 
flanked by the genetic markers mi233 and mi 167. 

54. The isolated nucleic acid sequence of claim 52, wherein said nucleic acid sequence is 
flanked by a pair of genetic markers selected from the group consisting of mi233 and g2616, 
g2616 and mi306, mi306 and m506, m506 and ngal2, ngal2 and BIO200, BIO200 and mi87, 

20 mi87 and CI 1 Ath, CI 1 Ath and m456A, and m456A and mi 167. 

55. A method of expressing a first foreign gene in a plant comprising transforming a 
regenerable plant cell with the plant artificial chromosome of claim 36. 

56. The method of claim 55, further comprising expressing a second foreign gene. 

57. The method of claim 56, further defined as comprising expressing a third foreign gene. 
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58. The method of claim 57 fiirthcr ,w;„^ 

' defined 33 com P™ng expressing a fourth foreign gene . 

59. A plant produced by the method of claim of claim 55. 

z. :rr^ - — *— • — ■ - 
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